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TITLE 

Sugarcane plant promoters to express heterologous n u cl e i c acids. 

FIELD OF THE INVENTION 

THIS INVENTION relates generally to isolate d nucleic acid 



S promoters for use in plant genetic engineering. More particularly, the present 
Invention relates to constitutive and tissue-specific promoters for expression of 
heterologous nucleic acids, such as foreign or endogenous coding sequences, in 
monocotyledonous plants. The invention is also concerned with a chimeric nucleic 
acid construct comprising a promoter of the invention operably linked to the 

10 heterologous nucleic acid, expression vectors, transformed plants, cells and tissues 
comprising an isolated promoter of the invention. 

BACKGROUND OF THE INVENTION 

A primary goal of genetic engineering is to obtain plants having 
improved chara cteristic s or traits. Many different types of characteristics or traits are 
1 5 considered advantageous, but those of particular importance include resistance to 
plant diseases, resistance to viruses or insects and resistance to herbicides. Other 
advantageous characteristics or traits include tolerance to cold or soil salinity, 
enhanced stability or shelf life of the ultimate consumer product obtained from a 
plant, or improvement in the nutritional value of edible portions of a plant 

20 Recent advances in genetic engineering have enabled the 

incorporation of a selected gene (or genes) into plant cells to impart a desired quality 
(or qualities) to a plant of interest. The selected gene (or genes) may be derived from 
a source different from the plant of interest or may be native to the desired plant, but 
engineered to have different or improved qualities. This new gene (or genes) may 

25 then be expressed in cells of the regenerated plant to exhibit the new trait or 
characteristic. 

In order for the newly incorporated gene to express the protein for 
which it codes in a plant cell, the proper regulatory signals must be present and in the 
proper location with respect to the gene. These regulatory signals include a promoter, 
30 a 5 1 non-translated leader sequence and a 3' polyadenylation signal. 
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Generally, the efficiency of gene expression is governed largely by 
the promoter used to express the gene. A promoter is a DNA sequence that directs 
the cellular machinery of a plant to produce (transcribe) RNA (transcript) from a 
contiguous transcribable region downstream (3*) of the promoter. The promoter 

5 influences the rate at which the transcript of the gene is made. Assuming the 
transcript includes a coding region with appropriate translational signals, the 
promoter also influences die rate at which the resultant protein product of the gene is 
produced. Promoter activity also can depend on the presence of several other ex- 
acting regulatory elements which, in conjunction with cellular factors, determine 

10 strength, specificity, and transcription initiation site (for a review, see Zawel and 
Reinberg, 1992, Curr. Opin. Cell Biol 4:488). 

It has been shown that certain promoters are able to direct RNA 
synthesis at a higher rate relative to other promoters. These are called "strong 
promoters". Certain other promoters have been shown to direct RNA production at 
15 higher levels only in particular types of cells or tissues and are often referred to as 
"tissue-specific promoters". Promoters that are capable of directing RNA production 
in many or all tissues of a plant are called "constitutive promoters". Thus, expression 
of a chimeric gene (or genes) introduced into a plant may potentially be controlled 
by identifying and using a promoter with the desired characteristics. 

20 A myriad of promoters has been described for gene expression in 

dicotyledonous plants. However, there is currently a dearth of promoters that can be 
used for effective expression of foreign or endogenous coding sequences in 
monocotyledonous plants. 

SUMMARY OF THE INVENTION 

25 Thus, in one aspect, the invention provides an isolated nucleic acid 

comprising a nucleotide sequence which corresponds to a promoter region of a 
transcribable DNA sequence that is hybridizable to a probe or primer derivable from 
a polynucleotide sequence selected from the group consisting of: - 

(a) the polynucleotide sequence set forth in FIG. 14 [SEQ ID NO: 1]; 
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(b) the polynucleotide sequence set forth in FIG. 15 under designator c51 [SEQ \ : 
ID NO: 2]; 

(c) the polynucleotide sequence set forth in FIG. IS under designator c51 1 [SEQ 
ID NO: 3]; 

5 (d) the polynucleotide sequence set forth in FIG. 15 under designator c512 [SEQ \ 

ID NO: 4]; 

(e) the polynucleotide sequence set forth in FIG. 16 [SEQ ID NO: 5]; 
(£) the polynucleotide sequence set forth in FIG. 17 [SEQ ID NO: 6]; 
(g) the polynucleotide sequence set forth in FIG. 23 [SEQ ID NO: 7]; 
10 (h) the polynucleotide sequence set forth in FIG. 24 [SEQ ID NO: 8]; 

(i) the polynucleotide sequence set forth in FIG. 26 [SEQ ID NO: 9]; 

(j) the polynucleotide sequence set forth in FIG. 29 [SEQ ID NO: 10]; and 

(k) the polynucleotide sequence set forth in FIG. 31 [SEQ ID NO: 11]. 

Advantageously, the isolated promoter region of the transcribable 
15 DNA sequence is of sufficient length such that it is capable of initiating and 
regulating transcription of a DNA sequence to which it is coupled. The promoter 
region may be between 100 bp and 4 kb in length and preferably greater than 1 kb in 
length. An analogous promoter region can be obtained from any plant species that 
has a DNA sequence that is transcribed in one or more cells or tissues of the plant 
20 provided that the DNA sequence is capable of hybridising to a nucleic acid probe 
derivable from a polynucleotide sequence as set forth in any one of SEQ ID NO: 1, 
2, 3, 4, 5, 6, 7, 8, 9, 10 and 11 under at least low stringency conditions, preferably 
under at least medium stringency conditions, and more preferably under high 
stringency conditions. 

25 The polynucleotide sequences set forth in SEQ ID NO: 1 , 2, 3, 4, 5, 6, 

7, 8, 9, 10 and 11 are transcribable sequences obtained from sugarcane (Saccharum 
sp.). The polynucleotide sequences identified by SEQ ID NO: 1, 2, 3, 4, 9, 10 and 11 
are highly transcribed in stem tissue, particularly mature stem tissue of sugarcane. 
Homologous sequences corresponding to SEQ ID NO: 2, 3 and 4 are expressed in 
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stem tissue of other monocotyledonous plants including maize and sorghum. . v; 
Homologous sequences corresponding to SEQ ID NO: 1, 9, 10 and 11 are also 1 
suspected to exist in other monocotyledonous plants. 

The invention thus also provides a polynucleotide sequence or variant XX 
5 thereof that is highly transcribed in stem tissue of monocotyledonous plants wherein 'i% 
said sequence is selected from the group consisting of any one of the polynucleotides ; • ; 
set forth in SEQ ID NO: 1, 2, 3, 4, 9, 10 and 11. ' ' - ^ 

The polynucleotide sequences identified by SEQ ID NO: 7 and 8 are - 
highly transcribed in meristem tissue of sugarcane. Thus, the invention also provides ■ /*/• 
10 a polynucleotide sequence or variant thereof that is highly transcribed in meristem 

tissue of monocotyledonous plants wherein said sequence is selected from the group 0, 
Xv)\ consisting of SEQ ID NO: 7 or SEQ ID NO: 8. 

The polynucleotides identified by SEQ ID NO: 5 and 6 are expressed 
constitutively in leaves, stems and roots of sugarcane. Homologous sequences V.'? 
15 corresponding to these polynucleotides are expressed in many or all tissues of other 
monocotyledonous plants including maize, rice and sorghum. 

Accordingly, the invention also features a polynucleotide sequence or 
variant thereof that is highly transcribed constitutively in tissue of 
monocotyledonous plants wherein said sequence is selected from the group 
20 consisting of: - 

(a) the polynucleotide sequence set forth in SEQ ID NO: 5; and 

(b) the polynucleotide sequence set forth in SEQ ID NO: 6. 

The foregoing polynucleotide sequences represent transcribable 
sequences, which can be used to construct a probe or primer(s) to isolate 
25 homologous transcribable sequences in other plant species, preferably, 
monocotyledonous species, so that a corresponding promoter region from said other 
plant species having the same tissue-specific or constitutive qualities can be isolated 
and used. .. ,./. 

Preferably, the isolated nucleic acid comprises a nucleotide sequence 
30 selected from the group consisting of: 
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(a) the polynucleotide sequence set forth in FIG. 18 [SEQ ID NO: 12]; 

(b) the polynucleotide sequence set forth in FIG. 19 [SEQ ID NO: 13]; 

(c) the polynucleotide sequence set forth in FIG. 20 [SEQ ID NO: 14]; 

(d) the polynucleotide sequence set forth in FIG. 21 [SEQ ID NO: 15]; 

5 (e) a biologically-active portion of any one of SEQ ID NO: 12, 13, 14 or 15; and 

(f) a variant of any one of the polynucleotide sequences according to (a) to (e). 

In one embodiment, the variant has at least 60%, preferably ait least 
80%, more preferably at least 90%, and most preferably at least 95% sequence 
identity to any one of the polynucleotides set forth in SEQ ID NO: 12, 13, 14 or 15. 
10 In another embodiment, the variant is capable of hybridising to any 

one of the polynucleotides identified by SEQ ID NO: 12, 13, 14 or 15 under at least 
low stringency conditions, preferably under at least medium stringency conditions, 
and more preferably under high stringency conditions. 

In another aspect, the invention provides a chimeric gene comprising 
15 an isolated nucleic acid of the invention, variant or biologically-active fragment and 
a heterologous nucleic acid. 

In yet another aspect, the invention provides an expression vector 
comprising an isolated nucleic acid of the invention, variant or biologically-active 
. fragment. 

20 Preferably, the expression vector comprises an isolated nucleic acid of 

the invention, variant or biologically-active fragment and a heterologous nucleic acid 
operably linked thereto. 

In one embodiment, the heterologous nucleic acid encodes a 
polypeptide such as a structural or regulatory protein. 

25 In another embodiment, the heterologous nucleic acid encodes a 

i molecule which is capable of modulating expression of a target gene. 

V In a preferred embodiment, the molecule is an antisense RNA or a 

• V ribozyme or other transcribed region aimed at downregulation of expression of the 

I corresponding target gene. For example, the said other transcribed region may 
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comprise a sense transcript aimed at sense suppression (co-suppression) of die target 
gene. 

Depending on the polynucleotide selected, in one embodiment, the 
expression construct may be further characterized in that said isolated nucleic acid is 

5 capable of directing transcription of the heterologous nucleic acid preferentially in 
stem tissue of a plant. In another embodiment, expression construct may be further 
characterized in that that said isolated nucleic acid is capable of directing 
transcription of the heterologous nucleic acid preferentially in meristem tissue of a 
plant. In yet another embodiment, the expression construct may be further 

1 0 characterized in that said isolated nucleic acid is capable of directing transcription of 
the heterologous nucleic acid in many or all tissues of a plant 

In a further aspect, the invention provides a method of transforming a 
plant, including the step of introducing into a plant cell or tissue an isolated nucleic 
acid, chimeric gene or expression vector of the invention. 

15 According to a still further another aspect of the invention, there is 

provided a transformed plant cell or tissue comprising an isolated nucleic acid, 
chimeric gene or expression vector of the invention. 

In a still yet further aspect, the invention provides a transformed plant 
comprising an isolated nucleic acid, chimeric gene or expression vector of the 
20 invention. 

Plants encompass any taxonomic grouping thereof^ including 
angiosperms, gymnosperms, monocotyledons and dicotyledons. Preferred plants are 
monocotyledons such as cereals, sugarcane, bananas and pineapples, but without 
limitation thereto. 
25 More preferably, the plant is sugarcane. 

Preferably, the transformed plant has an altered phenotype compared 
to a corresponding non-transformed plant 

Preferably, the altered phenotype results from expression of the 
heterologous nucleic acid. 
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The invention also provides cells, tissues, leaves, fruit, flowers, seeds 
and other reproductive material, material used for vegetative propagation, progeny 
plants including Fl hybrids, male-sterile plants and all other plants and plant 
products derivable from transgenic plants of the invention. 

5 BRIEF DESCRIPTION OF THE DRAWINGS 

In order that the invention may be readily understood and put into 
practical effect, preferred embodiments will now be described by way of example 
with reference to the accompanying drawings in which: 

FIG. 1: Northern analysis of total RNA extracted from leaf (L), stem 
10 (S) and root (R) of field-grown sugarcane plants. The cultivars were Pindar, 
NCo310, QUO and Q145. The probe is the cDNA c67 (SEQ ID NO: 1), revealing 
preferential expression of gene 67 in the stems of all tested sugarcane cultivars. The 
relative expression levels for the leaf / stem / root tissues are 0.7 /100 / 5 (Pindar), 
0.3 / 100 / 1.7 (NCo310), 0.3 / 100 / 2.8 (QUO) and 0.7 / 100/2 (Q145). 

15 FIG. 2: Northern analysis of total RNA extracted from different 

internodes of field-grown sugarcane plants (cultivar Pindar). Each plant had a total 
of 17 internodes, INi corresponds to internode 1 (top of the stem) with other 
internodes numbered down the stem. The probe is the cDNA c67 (SEQ ID NO: 1), 
revealing preferential expression of gene 67 in mature internodes. The relative 

20 expression levels are 0 / 0.7 / 9.7 / 47 / 100 / 45 (INi / IN4/ IN 6 / INg / IN n / IN15). 

FIG. 3 : Northern analysis of total RNA extracted from leaf (L), stem 
(S) and root (R) of field-grown sugarcane plants. The cultivars were Pindar, 
NCo310, QUO and Q145. The probe is a Nhel DNA fragment from the cDNA c51 
(SEQ ID NO: 2, 3 and 4), revealing stem-preferential expression of gene 51 in all 
25 tested sugarcane cultivars. The relative expression levels for the leaf / stem / root 
tissues are 19.5 / 100 / 18 (Pindar), 17.5 / 100 / 23.7 (NCo310), 16.8 / 100 / 20 
(Ql 10) and 12.5 / 100 / 15.8 (Q145). 

FIG. 4: Northern analysis of total RNA extracted from different parts 
U of cultivar Pindar sugarcane plants with 17 internodes. IN6 and INu correspond 
30 respectively to internode 6 and 1 1. Nc is the core of a nodal part of the stem and N is 
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an entire nodal part of the stem. M corresponds to the meristematic region of the 
stem. R is the root tissue. YL, ML and OL represent different stages of leaf 
development, Y being the young and recently emerged leaves, M being the more 
mature leaves and O being leaves at an older stage. The probe is the cDNA cSl 
5 (SEQ ID NO: 2, 3 and 4), revealing expression of gene 51 throughout the sugarcane 
stem. 

FIG. 5: Northern analysis of total RNA extracted from different parts 
of field-grown maize plants and glasshouse-grown sorghum plants. The different 
tissues analysed were male flower parts (F), top of the stem (TS), medium and lower 
10 part of the stem (MS and LS), root system (R), leaves at a young, mature and older 
stage of development (YL, ML and OL) and panicle (P). The probe is the cDNA c5l, 
revealing expression of a homologue of sugarcane stem-specific gene 51 (SEQ ID 
NO: 2, 3 and 4) in maize and sorghum. 

FIG. 6: Northern analyses of total RNA extracted from different parts 
15 of field-grown sugarcane (cultivar Pindar) and maize (sweet corn) stems. For the 
sugarcane RNA, each plant had a total of 1 7 internodes, INi corresponds to internode 
1 (top of the stem), with other internodes numbered down the stan. For the maize 
RNA, each plant had a total of 13 internodes, INi corresponds to internode 1 (top of 
the stem), with other internodes numbered down the stem. The probe is the cDNA 
20 c51 (SEQ ID NO: 2, 3 and 4), revealing expression of gene 51 in internodes at all 
stages of maturity in both sugarcane and maize. The relative expression levels are 44 
/ 47 / 51 / 64 / 100 / 61 (INi / IN 4 / IN 6 / INg / INh / IN !5 ) in sugarcane stems and 40 / 
44/75/87/100/72 (IN1/IN3/IN5/IN7/ IN 9 /INn) in maize stems. 

FIG. 7: Northern analysis of total RNA extracted from various parts 
25 of field-grown sugarcane plants (cultivar Pindar). Each plant had a total of 17 
internodes. The tissues analysed were a nodal part of the stem (N), internodes 1-2 
(IN1-2, top of the stem), 6 and 11 (IN 6 and IN n ), the root system (R) and leaves at a 
young, mature and older stage of development (YL, ML and OL). The probe is the 
cDNA c32A (SEQ ID NO: 5 and 6), revealing expression of gene 32A in all tested 
30 sugarcane tissues. The relative expression levels are 42 / 100 / 92 / 94 / 59 / 43 / 70 / 
60(N/IN,. 2 /IN6/INii/R/YL/ML/OL). 
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FIG. 8: Northern analysis of total RNA extracted from leaf (L), stem 
(S) and root (R) of field-grown sugarcane plants. The cultivars were Pindar, 
NCo310, QUO and Q145. The probe is the cDNA c322 (SEQ ID NO: 5 and 6), 
revealing constitutive expression of gene 32A in all tested sugarcane cultivars. The 
5 relative expression levels for the leaf / stem / root tissues are 58 / 100 / 40 (Pindar), 
48 / 100 / 43 (NCo3 10), 88/100/58 (Ql 10) and 70 / 100 / 51 (Q145). 

FIG. 9: Northern analyses of total RNA extracted from different parts 
of field-grown sugarcane (cultivar Pindar) and maize (sweet com) stems. For the 
sugarcane RNA, each plant had a total of 17 intemodes, INi corresponds to intemode 

10 1 (top of the stem), with other intemodes numbered down the stem. For the maize 
RNA, each plant had a total of 13 intemodes, INi corresponds to intemode 1 (top of 
the stem), with other intemodes numbered down the stem. The probe is the cDNA 
c322 (SEQ ID NO: 5 and 6), revealing expression of gene 32A and its maize 
homologue in intemodes at all stages of maturity in both sugarcane and maize. The 

15 relative expression levels are 81 / 71 / 81 / 98 / 100 / 74.5 (INi / IN 4 / INfe / IN 8 / INn 
/ IN I5 ) in sugarcane stems and 74.5 / 94 / 97.5 /100 / 82 / 90 (INi / IN 3 / IN 5 / IN 7 / 
IN9 / INn) in maize stems. 

FIG. 10: Northern analysis of total RNA extracted from glasshouse- 
grown sorghum and rice plants. For the sorghum RNA, the different tissues analysed 
20 were the panicle (P), the top, middle part and lower parts of the stem (TS, MS and 
LS), the root system (R) and leaves at a young, mature and older stage of 
development (YL, ML and OL). For the rice RNA, the different tissues analysed 
were the young and older stems (S 2 and Si), the root system (R) and the young, 
mature and older leaves (L 3 , L 2 and L t ). The probe is the cDNA c322 (SEQ ID NO: 
25 5 and 6), revealing a constitutive expression of homologues of gene 32A in all the 
tissues tested in sorghum and rice. 

FIG. 11: Southern analysis of genomic DNA (10 jigflane) from 
cultivar Q117 digested by £coRI (E), Kpn\ (K), Sad (S), Xbal (X) and Notl (N), 
indicating genomic copies of gene 67 in sugarcane. The probe is the cDNA c67 
30 (SEQ ID NO: 1). Lane %m indicates the position of molecular weight markers on the 
same electrophoresis gel. 




WO 01/18211 PCT/AU00/01033 

10 

FIG. 12: Southern analysis of genomic DNA (10 ug/lane) fiom 
cultivar Q117 digested by EcdRI (E), Kpnl (K), SacI (S), Xba\ (X) and Notl (N), U 
indicating few genomic copies of gene 51 in sugarcane. The probe is the 3* region of [ ^ 
the cDNA c51 (nucleotide 621 to nucleotide 874 of SEQ ID NO: 2, 3 and 4). Lane ' fr; 
5 X H 3 indicates the position of molecular weight markers on the same electrophoresis V? }; 
gel. 

FIG. 13: Southern analysis of genomic DNA (10 ug/lane) fiom ; 
cultivar Q117 digested by EcdRI (E), Kpnl (K), Sad (S), Xbal (X) and M*I (N), 
indicating multiple copies of gene 32A in sugarcane. The probe is the cDNA c322 > 
10 (SEQ ID NO: 5 and 6). Lane Xr3 indicates the position of molecular weight maites|v' 
on the same electrophoresis gel. 

FIG. 14: Nucleotide sequence of the cDNA c67 (SEQ ID NO: 1) and )|y 
deduced amino acid sequence of the longest open reading frame (SEQ ID NO: 16). 5g 
The ATG start codon is at position 33 to 35, the stop codon (TGA) is at position 594 v 
15 to 596. No homologous sequences (with more than 40% identity on the whole 
sequence) were found in the databases (NR nucleic, ANGIS) at either the nucleotide 
or the amino acid level. Two putative polyadenylation signals (AATAAA) are 
present between nucleotide 853 and nucleotide 862. 

FIG. 15: Nucleotide sequence alignment of the cDNA clones c51, 
20 c51 1 and c5l2 (SEQ ID NO: 2, 3 and 4) and deduced amino acid sequence (aa) of . 
the longest and conserved open reading frames (SEQ ID NO: 17). The ATG start : 
codon has been assigned to position 129 to 131, the stop codon (TGA) is at position ? 
576 to 578. The cDNA clone c51 1 has a stop codon TAG at position 435 to 437. The ; 
alanine (A) residue deduced fiom nucleotide position 402-404 in the cDNAs c51 and 
25 c5 1 1 is changed to a valine residue (V) in the deduced amino acid sequence fiom the 
cDNA c5 12. The valine (V) residue deduced from nucleotide position 501-503 in the 
cDNAs c511 and c512 is changed to an alanine (A) residue in the deduced amino 
acid sequence of the cDNA c51. The nucleotide and deduced ainmo-acid sequences 
of all the cDNAs have no homologues in the nucleotide and protein databases (NR 
30 nucleic, ANGIS). 
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FIG, 16: Nucleotide sequence of cDNA clone c32A (SEQ ID NO: 5) 
and deduced amino acid sequence (SEQ ID NO: 22). The stop codon (TAG) is at 
position 44 to 46. This partial protein sequence (SEQ ID NO: 22) is homologous to 
S27 ribosomal proteins from rice, barley, Arabidopsis thaliana, Chlamydomonas 
5 reinhardtii and other organisms. 

FIG. 17: Nucleotide sequence and deduced amino acid sequence of 
the cDNA clone c322 (SEQ ID NO: 6 and 23, respectively). The ATG start codon is 
at position 92 to 94, the stop codon (TAG) is at position 350 to 352. The protein 
sequence (SEQ ID NO: 23) is homologous to S27 ribosomal proteins from rice, 
10 barley, Arabidopsis thaliana, Chlamydomonas reinhardtii and other organisms. 

FIG. 18: Nucleotide sequence of the 67pro promoter sequence (SEQ 
ID NO: 12). The start of the coding sequence is at position 1045. A putative "tata 
box" is present at position 964-969. The predicted start of transcription is at position 
994(A). 

15 FIG. 19: Nucleotide sequence of the 32a2pro promoter sequence 

(SEQ ID NO: 13). The start of the coding sequence is at position 1943. 

FIG. 20: Nucleotide sequence of the 32a6pro promoter sequence 
(SEQ ID NO: 14). The start of the coding sequence is at position 1266. A putative 
"tata box" is present at position 1 138-1 145. The predicted start of transcription is at 
20 position 1170(C). 

FIG. 21: Nucleotide sequence of the 51pro promoter sequence (SEQ 
ID NO: 15). The start of the coding sequence is at position 2946. 

FIG. 22: GUS histochemical assay of cross-sections of three 
transgenic sugarcane plant stems transformed with plasmid p67G. The strong blue 
25 staining (shown as dark grey to black on the figure) indicates the presence of the 
GUS protein and confirms the functionality of the promoter fused to the GUS gene. 

FIG. 23: Nucleotide sequence of the cDNA cl9 (SEQ ID NO: 7). 
. Neither the nucleotide sequence nor the deduced amino acid sequences (in the six 
frames) have hbmologues in the databases (NR nucleic and NR Proteins, ANGIS). 
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FIG. 24: Nucleotide sequence of the cDNA c3 (SEQ ID NO: 8). The 
sequence from nucleotide 144 to nucleotide 395 (deduced amino acid sequence SEQ 
ID NO: 21) is homologous to part of the cellulase (EC 3.2.1.4) coding sequence from 
a variety of sources (e.g. Arabidopsis thaliana, tomato, tobacco, pepper, termites and 
5 bacteria). 

FIG. 25: Northern analysis of total RNA extracted from different parts 
of field-grown sugarcane plants. The probe is the cDNA c!9 (SEQ ID NO: 7), 
revealing preferential expression in the meristematic region of the stem (M). No 
expression can be detected in other tissues such as the core region of a stem 
10 internode (INc), a whole stem internode (IN), the core region of a stem node (Nc), a 
whole stem node (N), the root system (R), young leaves (YL), mature leaves (ML) 
and older leaves (OL). An identical result is obtained (except for the size of the 
hybridising transcript) when using the cDNA c3 (SEQ ID NO: 8) as the probe. 

FIG. 26: Nucleotide sequence and deduced amino acid sequence of 
15 the cDNA cl8 (SEQ ID NO: 9 and 18, respectively). The ATG start codon is present 
at position 1-3 and the stop codon (TGA) is at position 244-246. The amino acid 
: sequence is homologous to type 2 plant metallothionein-like proteins previously 
described in different plants such as Arabidopsis thalianai tobacco, barley, fava bean 
and others. 

20 FIG. 27: Northern analysis of total RNA extracted from leaf (L), stem 

(S) and root (R) of field-grown sugarcane plants. The cultivars were Pindar, 
NCo310, QUO and Q145. The probe is the cDNA c!8 (SEQ ID NO: 9), revealing 
preferential expression of gene 18 in the stems of all tested sugarcane cultivars. The 
relative expression levels for the leaf / stem / root tissues are 6 / 100 / 8.5 (Pindar), 9 

25 / 100 / 10 (NCo310), 11.5 / 100 / 11.5 (QUO) and 6 / 100 / 5 (Q145). 

FIG. 28: Northern analysis of total RNA extracted from different 
internodes of field-grown sugarcane plants (cultivar Pindar). Each plant had a total 
of 17 internodes, INi corresponds to internode 1 (top of the stem) with other 
internodes numbered down the stem. The probe is the cDNA cl8 (SEQ ID NO: 9), 

30 revealing preferential expression in the young parts of the stem, with a gradual 
decrease in the expression levels towards the base of the stem (older parts). The 
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relative expression levels are 100 / 98 / 86 / 69 / 40 / 25 (IN| / IN 4 / IN 6 /IN 8 /INn/ 
IN, 5 ). 

FIG. 29: Nucleotide sequence of the cDNA c53A (SEQ ID NO: 10) 
and deduced amino acid sequence (SEQ ID NO: 19). The ATG start codon is at 
5 position 124-126, the stop codon TGA is at position 1069-1071. The protein 
sequence is homologous to homeobox proteins described in plants (e.g. Arabidopsis 
thaliana ATK1 protein and KNA2 knotted-like homeobox protein, soybean HMB1 
homeobox protein, maize RSI gene product, rice OSH1 homeobox 1 protein and 
others). 

10 FIG. 30: Northern analysis of total RNA extracted from leaf (L), stem 

(S) and root (R) of field-grown sugarcane plants. The cultivars were Pindar, Nco310, 
QUO and Q145. The probe is the cDNA c53A (SEQ ID NO: 10), revealing 
preferential expression of gene 53A in the stems of all tested sugarcane cultivars. 
The relative expression levels for the leaf / stem / root tissues are 4 / 100 / 14 

15 (Pindar), 5/100/12 (NCo310), 0 / 100 / 1 1 (Ql 10) and 0 / 100 / 8 (Q145). 

FIG. 31: Nucleotide sequence (SEQ ID NO: 11) and deduced amino 
acid sequence (SEQ ID NO: 20) of the cDNA c57. The stop codon (TGA): is at 
position 567-569. The amino acid sequence and the nucleotide coding sequence are 
homologous to several auxin-induced and auxin-responsive genes / proteins from 
20 different plants such as Arabidopsis thaliana, soybean and tobacco. 

FIG. 32: Northern analysis of total RNA extracted from leaf (L), stem 
(S) and root (R) of field-grown sugarcane plants. The cultivars were Pindar, Nco310, 
QUO and Q145. The probe is the cDNA c57 (SEQ ID NO: 11), revealing 
preferential expression of gene 57 in the stems of all tested sugarcane cultivars. The 
25 relative expression levels for the leaf / stem / root tissues are 0 / 100 / 0 (Pindar), 0 / 
100/0 (Nco310), 15 / 100 / 7 (QUO) and 15 / 100 / 12 (Q145). 

FIG. 33: Northern analysis of total RNA extracted from different 
internodes of field-grown sugarcane plants (cultivar Pindar). Each plant had a total 
of 17 internodes, INi corresponds to internode 1 (top of the stem) with other 
30 internodes numbered down the stem. The probe is the cDNA c57 (SEQ ID NO: 1 1), 
revealing preferential expression in the young parts of the stem, with a gradual 
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decrease in the expression levels towards the base of the stem (older parts). The 
relative expression levels are 100 / 86 / 82 / 65 / 45 / 20 (IN| / IN 4 / IN 6 / IN 8 / DSfi i / 
IN l5 ). 

BRIEF DESCRIPTION OF THE SEQUENCES 

5 SEQ ID NO: 1 Nucleotide sequence of cDNA clone c67, 

which is preferentially expressed in the stem of field-grown sugarcane plants. 

SEQ ID NO: 2 Nucleotide sequence of cDNA clone c51, 

which is preferentially expressed in the stem of field-grown sugarcane plants. 

SEQ ID NO: 3 Nucleotide sequence of cDNA clone c5 11, 

1 0 which is preferentially expressed in the stem of field-grown sugarcane plants. 

SEQ ID NO: 4 Nucleotide sequence of cDNA clone c512, 

which is preferentially expressed in the stem of field-grown sugarcane plants. 

SEQ ID NO: 5 Nucleotide sequence of cDNA clone c32A, 

which is transcribed constitutively in leaves, stems and roots of field-grown 
15 sugarcane plants. 

SEQ ID NO: 6 Nucleotide sequence of cDNA clone c322, 

which is transcribed constitutively in leaves, stems and roots of field-grown 
sugarcane plants. 

SEQ ID NO: 7 Nucleotide sequence of cDNA clone cl 9, 

20 which is preferentially expressed in meristematic tissue of field-grown sugarcane 
plants. 

SEQ ID NO: 8 Nucleotide sequence of the cDNA c3, which is 

preferentially expressed in meristematic tissue of field-grown sugarcane plants. 

SEQ ID NO: 9 Nucleotide sequence of cDNA clone cl8, 

25 which is preferentially expressed in the stem of field-grown sugarcane plants. 

SEQ ID NO: 10 Nucleotide sequence of cDNA clone c53A, 
which is preferentially expressed in the stem of field-grown sugarcane plants. 

SEQ ID NO: 11 Nucleotide sequence of cDNA clone c57, 
which is preferentially expressed in the stem of field-grown sugarcane plants. 
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SEQ ID NO: 12 Nucleotide sequence of the stem-specific 67 
promoter. E. coli DH10B strain transformed with plasmid pG4-67pro comprising the 
67 promoter was deposited with AGAL on August 30, 1999 under accession number 
NM99/05995. 

5 SEQ ID NO: 1 3 Nucleotide sequence of the stem-specific 32A2 

promoter. E. coli TOP10 strain transformed with pZ-32A2P2 comprising the 32A2 
promoter was deposited with AGAL on August 30, 1999 under accession number 
NM99/05994. 

SEQ ID NO: 14 Nucleotide sequence of the constitutive 32A6 
10 promoter. E. coli TOP10 strain transformed with pZ-32A6P2 comprising the 32A6 
promoter was deposited with AGAL on August 30, 1999 under accession number \ 
NM99/05993. 

SEQ ID NO: 15 Nucleotide sequence of the constitutive 51 
promoter. E. coli TOP10 strain transformed with pZ-H51-6PlP2 comprising the 51 
15 promoter was deposited with AGAL on August 30, 1999 under accession number, 
NM99/05992. 

SEQ ID NO: 16 Deduced amino acid sequence of the longest 
open reading frame of SEQ ID NO: 1. 

SEQ ID NO: 17 Deduced amino acid sequence of the longest 
20 and conserved open reading frame common to each of SEQ ID NO: 2, 3 and 4. 

SEQ ID NO: 18 Deduced amino acid sequence of the longest 
open reading frame of SEQ ID NO: 5. 

SEQ ID NO: 19 Deduced amino acid sequence of the longest 
open reading frame of SEQ ID NO: 10. 
25 SEQ ID NO: 20 Deduced amino acid sequence of the longest 

open reading frame of SEQ ID NO: 11. 

. SEQ ID NO: 21 Deduced amino acid sequence of the longest 
open reading frame of SEQ ID NO: 8. 
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SEQDDNO:22 Deduct amino acid sequence of the longest / 
open reading frame of SEQ ID NO: 5. 

SEQ ID NO: 23 Deduced amino acid sequence of the longest ; 
open reading frame of SEQ ID NO: 6. ; \\ 

5 DETAILED DESCRIPTION —ft 

/ ■' "V^y - i-^.l ";Y " ■ f<\'- ui . * \V-"' : " 

1. Definitions 

Unless defined otherwise, all technical and scientific terms used - 
herein have the same meaning as commonly understood by those of ordinary skill in 
the art to which the invention belongs. Although any methods and materials similar - • 

10 or equivalent to those described herein can be used in the practice or testing of the 

Y. ' v : v - " -if*'< ''• ■-<' •• . '*■ , 

present invention, preferred methods and materials are described.Tor the purposes of £ 

the present invention, the following terms are defined below. :-^< 

''Amplification product " refers to a nucleic acid product generated by • 
nucleic acid amplification techniques. 

15 By "biologically active fragment" is meant a segment, portion or 

fragment of a the biological active molecule which has at least about 0.1%, 
preferably at least about 10%, more preferably at least about 25% and even more 
preferably at least 50% of the activity of the molecule. 

"Chimeric gene" is defined herein as a nucleic acid, preferably aV". 
20 DNA molecule, either single- or double-stranded, which includes an isolated nucleic 
acid of the invention, variant or biologically-active fragment, operably linked to a 
heterologous nucleic acid. 

By "corresponds to" or "corresponding to" is meant similar, related 
or analgous to in structure and/or function. For example, an isolated nucleic acid of 
25 the invention suitably comprises a nucleotide sequence structurally and/or 
functionally related to a corresponding promoter sequence, siich that the isolated 
nucleic acid of the invention is capable of functioning as a promoter. 
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The term "regeneration " as used herein means growing a whole, 
differentiated plant from a plant cell, a group of plant cells, a plant part (including 
seeds), or a plant piece (e.g., from a protoplast, callus, or tissue part). 

By "heterologous nucleic acid" is meant a nucleic acid distinct from 
5 an isolated promoter of the invention. Operationally, the heterologous nucleic acid is 
, operably linked to an isolated promoter nucleic acid of the invention to achieve 
expression of the heterologous nucleic acid. The term heterologous nucleic acid 
V encompasses transcribable DNA as will be defined hereinafter. 

"Homology" refers to the percentage number of nucleotides of a 
10 polynucleotide sequence that are identical to a reference polynucleotide sequence. 
Homology may be determined using sequence comparison programs such as 
V BESTFTT (Deveraux et al. 1984, Nucleic Adds Research 12, 387-395) which is 
incorporated herein by reference. In this way sequences of a similar or substantially 
different length to those cited herein might be compared by insertion of gaps into the 
15 alignment, such gaps being determined, for example, by the comparison algorithm 
used by BESTFTT. 

"Hybridization" is used herein to denote the pairing of complementary 
nucleotide sequences to produce a DNA-DNA hybrid or a DNA-RNA hybrid. 
Complementary base sequences are those sequences that are related by the base- 
20 pairing rules. In DNA, A pairs with T and C pairs with G. In RNA U pairs with A 
and C pairs with G. In this regard, the terms "match" and "mismatch" as used herein 
refer to the hybridisation potential of paired nucleotides in complementary nucleic 
acid strands. Matched nucleotides hybridise efficiently, such as the classical A-T and 
/ : G-C base pair mentioned above. Mismatches are other combinations of nucleotides 
25 that do not hybridise efficiently. 

By "isolated " is meant material that is substantially or essentially free 
from components that normally accompany it in its native state. For example, an 
: "isolated nucleic acid", as used herein, refers to a nucleic acid, which has been 
purified from the sequences which flank it in a naturally occurring state, e.g. y a DNA 
30 fragment which has been removed from the sequences which are normally adjacent 
to the fragment 
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By "marker gene" is meant a gene that imparts a distinct phenotype 
to cells expressing the marker gene and thus allows such transformed cells to be 
distinguished from cells that do not have the marker. A selectable marker gene 
confers a trait for which one can 'selecf based on resistance to a selective agent (e.g. 9 
5 a herbicide, antibiotic, radiation, heat, or other treatment damaging to untransformed 
cells). A screenable marker gene (or reporter gene) confers a trait that one can 
identify through observation or testing, Le., by 'screening' (eg; green fluorescent 
protein or enzymes such as ^-glucuronidase, neomycin phosphotransferase H, and 
luciferase not present in untransformed cells). 

10 The term "nucleic acid 3 as used herein designates DNA and/or RNA 

including mRNA, cRNA, cDNA and genomic DNA in single-stranded and double- 
stranded form, and encompasses oligonucleotides and polynucleotides as herein 
defined. 

By "obtained from " is meant that a sample such as, for example, a 
15 nucleic acid extract is isolated from, or derived from, a particular source of the host. 
For example, the nucleic acid extract may be obtained from tissue isolated directly 
from the host 

The term "oligonucleotide" as used herein refers to a polymer 
composed of a multiplicity of nucleotide units (deoxyribonucleotides or 
20 ribonucleotides, or related structural variants or synthetic analogues thereof) linked 
via phosphodiester bonds (or related structural variants or synthetic analogues 
thereof). Thus, while the term "oligonucleotide" typically refers to a nucleotide 
polymer in which the nucleotides and linkages between than are naturally occurring, 
it will be understood that die term also includes within its scope various analogues 

25 including, but not restricted to, peptide nucleic acids (PNAs), phosphoramidates, 
phosphorothioates, methyl phosphonates, 2-O-methyl ribonucleic acids, and the like. 
The exact size of the molecule may vary depending on the particular application. An 
oligonucleotide typically comprises from about 10 to 30 nucleotides, but the term 
can refer to molecules of any length, although the term "polynucleotide" usually 

30 refers to nucleic acids of more than 100 nucleotides in length. 

By "operably linked 9 is meant functionally linked. For example, an 
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isolated nucleic acid of the invention is operably linked to a heterologous nucleic 
acid regulated by being linked thereto so as to be capable of initiating, controlling, 
regulating or otherwise directing transcription of the heterologous nucleic acid. 
Usually, the isolated nucleic acid is located upstream or 5* of the heterologous 
5 nucleic acid. 

"Polypeptide", "peptide" and "protein" are used interchangeably 
herein to refer to a polymer of amino acid residues and to variants and synthetic 
analogues of the same. Thus, these terms apply to amino acid polymers in which one 
or more amino acid residues is a synthetic non-naturally occurring amino acid, such 
10 as a chemical analogue of a corresponding naturally occurring amino acid, as well as 
to naturally-occurring amino acid polymers. 

By "prime?* is meant an oligonucleotide or polynucleotide which is 
capable of hybridizing to a complementary or partly complementary nucleic acid 
strand (template) and initiating synthesis of a primer extension product in the 
15 presence of a suitable polymerising agent The primer is preferably single-stranded 
for maximum efficiency in amplification but may alternatively be double-stranded. 
A prima: must be sufficiently long to prime the synthesis of extension products in 
the presence of the polymerisation agent. The length of the primer depends on many 
factors, including application, temperature to be employed, template reaction 
20 conditions, other reagents, and source of primers. For example, depending on the 
complexity of the target sequence, the oligonucleotide primer typically contains 15 
to 35 or more nucleotides, although it may contain fewer nucleotides. Primers can be 
large polynucleotides, such as from about 200 nucleotides to several kilobases or 
more. Primers may be selected to be "substantially complementary" to the sequence 
25 on the template to which it is designed to hybridise and serve as a site for the 
initiation of synthesis. By "substantially complementary*, it is meant that the primer 
is sufficiently complementary to hybridise with the template. Preferably, the primer 
contains no mismatches with the template to which it is designed to hybridise but 
this is not essential. For example, non-complementary nucleotides may be attached 
30 to the 5'-end of the primer, with the remainder of the primer sequence being 
complementary to the template. Alternatively, non-complementary nucleotides or a 
stretch of non-complementary nucleotides can be interspersed into a primer, 
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provided that the primer sequence has sufficient complementarity with the sequence 
of the template to hybridise therewith and thereby form a template for synthesis of 
the extension product of the prima-. 

"Probe " refers to a molecule that binds to a specific sequence or sub- 
5 sequence or other moiety of another molecule. Unless otherwise indicated, the term 
"probe" typically refers to a polynucleotide probe that binds to another nucleic acid, 
often called the 'target nucleic acid", through complementary base pairing. Probes 
may bind target nucleic acids lacking complete sequence complementarity with the 
probe, depending on the stringency of the hybridisation conditions. Probes can be 
10 directly or indirectly labelled. 

The term "promoter** refers to a nucleic acid which directs expression 
< of another nucleic acid to which it is operably linked, by initiating, regulating or 
otherwise controling transcription of said another nucleic acid. 

"Constitutive promoter" refers to a promoter that directs 
15 transcription in many or all tissues of a plant. 

By "stem-specific promoter" is meant a promoter that preferentially 
directs transcription in stem tissue of a plant. 

By "meristem-specific promoter" is meant a promoter that 
preferentially directs transcription in meristem tissue of a plant. 

20 The term "recombinant" as used herein refers to a nucleic acid or 

polypeptide resulting from in vitro manipulation into a form not normally found in 
nature. As used in the art, "recombinant* usually refers to the products of 
recombinant DNA technology. 

Terms used to describe sequence relationships between two or more 
25 polynucleotides or polypeptides include "reference sequence", "comparison 
window", "sequence identity**, "percentage of sequence identity" and "substantial 
identity". A "reference sequence*" is at least 6 but frequently 15 to 18 and often at 
least 25 monomer units, inclusive of nucleotides and amino acid residues, in length. 
Because two polynucleotides may each comprise (1) a sequence (i.e., only a portion 
30 of the complete polynucleotide sequence) that is similar between the two 
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polynucleotides, and (2) a sequence that is divergent between the two 
polynucleotides, sequence comparisons between two (or more) polynucleotides are 
typically performed by comparing sequences of the two polynucleotides over a 
"comparison window" to identify and compare local regions of sequence similarity. 
5 A "comparison window^ refers to a conceptual segment of typically 6 to 12 
contiguous residues that is compared to a reference sequence. The comparison 
window may comprise additions or deletions (Le. 9 gaps) of about 20% or less as 
compared to the reference sequence (which does not comprise additions or deletions) 
for optimal alignment of the two sequences. Optimal alignment of sequences for 
10 aligning a comparison window may be conducted by computerised implementations 
of algorithms (GAP, BESTFTT, FASTA, and TFASTA in the Wisconsin Genetics 
Software Package Release 7.0, Genetics Computer Group, 575 Science Drive 
Madison, WI, USA, incorporated herein by reference) or by inspection and the best 
alignment (i.e., resulting in the highest percentage homology over the comparison 
15 window) generated by any of the various methods selected. Reference also may be 
made to the BLAST family of programs as for example disclosed by Altschul et al. 9 
1997, Nucl Acids Res. 25:3389, which is incorporated herein by reference. A 
detailed discussion of sequence analysis can be found in Unit 19.3 of Ausubei et al. 9 
"Current Protocols in Molecular Biology**, John Wiley & Sons Inc, 1994-1998, 
20 Chapter 15. 

The term "sequence identity" as used herein refers to the extent that 
sequences are identical on a nucleotide-by-nucleotide basis over a window of 
comparison. Thus, a "percentage of sequence identity" is calculated by comparing 
two optimally aligned sequences over the window of comparison, determining the 

25 number of positions at which the identical nucleic acid base (e.g:, A, T, C, G, I) 
occurs in both sequences to yield the number of matched positions, dividing the 
number of matched positions by the total number of positions in the window of 
comparison (i.e., the window size), and multiplying the result by 100 to yield the 
percentage of sequence identity. For the purposes of the present invention, 

30 "sequence identity" will be understood to mean the 4t match percentage" calculated 
by the DNASIS computer program (Version 2.5 for windows; available from Hitachi 
Software engineering Co., Ltd., South San Francisco, California, USA) using 
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standard defaults as used in the reference manual accompanying the software, which 
is incorporated herein by reference. 

"Stringency" as used herein, refers to the temperature and ionic 
strength conditions, and presence or absence of certain organic solvents, during ; ; 
5 hybridisation. Hie higher the stringency, the higher will be the degree of ; 
complementarity between immobilised nucleotide sequences and the labelled 
polynucleotide sequence that remain bound following hybridization. 

"Stringent conditions" refers to temperature and ionic conditions 
under which only nucleotide sequences having a high frequency of complementary 
10 bases will hybridise. The stringency required is nucleotide sequence dependent and 
also depends upon the various components present during hybridisation. Generally, 
' stringent conditions are selected to be about 10 to 20°C lower than the thermal -; 
melting point (T m ) for the specific sequence at a defined ionic strength and pH. The 
T m is the temperature (under defined ionic strength and pH) at which 50% of a target 
15 sequence hybridises to a complementary probe. 

The term "transcribable DNA sequence" or "transcribed DNA 
sequence", excludes the non-transcribed regulatory sequence that drives 
transcription. Depending on the aspect of the invention, the transcribable sequence 
may be derived in whole or in part from any source known to the art, including a 
20 plant, a fungus, an animal, a bacterial genome or episome, eukaryotic, nuclear or 
plasmid DN A, cDNA, viral DNA or chemically synthesized DNA. A transcribable 
sequence may contain one or more modifications in either the coding or the 
untranslated regions which could affect the biological activity or the chemical 
structure of the expression product, the rate of expression or the manner of 
25 expression control. Such modifications include, but are not limited to, insertions, 
deletions and substitutions of one or more nucleotides. The transcribable sequence 
may contain an uninterrupted coding sequence or it may include one or more introns, 
bound by the appropriate splice junctions. The transcribable sequence may also 
encode a fusion protein. It is contemplated that introduction into plant tissue of 
30 chimeric nucleic acid constructs of the invention will include constructions wherein 
the transcribable sequence and its promoter are each derived from different species. 
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The term "transformation" means alteration of genotype by 
introduction of genetic material into an organism. 

By "transgenic" is meant an organism that is transformed. 

By "transgenote" is meant an immediate product of a transformation 

5 process. 

The term "variant " is used in the context of a nucleic acid or protein 
which displays a definable level of sequence identity with a reference nucleic acid or 
protein respectively. For example, a variant nucleic acid is hybridizable with a 
reference sequence under stringent conditions that are defined hereinafter, or shares a 

10 \ percent level of sequence identity definable using a sequence comparison algorithm 
as hereinbefore described. Variants also encompass nucleic acids in which one or 
more nucleotides have been added or deleted, or replaced with different nucleotides 
or modified bases (eg. inosine, methylcytosine). In this regard, it is well understood 
in the art that certain alterations inclusive of mutations, additions, deletions and 

IS substitutions can be made to a reference nucleic acid whereby the altered 
polynucleotide retains the biological function or activity of the reference 
polynucleotide. The term "variant " also include naturally occurring allelic variants. 

By "vector** is meant a nucleic acid, preferably a DNA molecule 
derived, for example, from a plasmid, bacteriophage, or plant virus, into which a 
20 . nucleic acid sequence may be inserted or cloned. A vector preferably contains one or 
more unique restriction sites and may be capable of autonomous replication in a 
defined host cell including a target cell or tissue or a progenitor cell or tissue thereof, 
or be integratable with the genome of the defined host such that the cloned sequence 
; is reproducible. Accordingly, the vector may be an autonomously replicating vector, 
25 i.e.. a vector that exists as an extrachromosomal entity, the replication of which is 
V ! independent of chromosomal replication, e.g., a linear or closed circular plasmid, an 
extrachromosomal element, a minichromosome, or an artificial chromosome. The 
■!! vector may contain any means for assuring self-replication. Alternatively, the vector 
• 'j ; may be one which, when introduced into the host cell, is integrated into the genome 
30 and replicated together with the chromosome(s) into which it has been integrated. A 
• vector system may comprise a single vector or plasmid, two or more vectors or 
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plasmids, which together contain the total DNA to be introduced into the genome of 
the host cell, or a transposon. The choice of the vector will typically depend on the 
compatibility of the vector with the host cell into which the vector is to be 
introduced. The vector may also include a selection marker such as an antibiotic 
5 resistance gene that can be used for selection of suitable transfbrmants. Examples of 
such resistance genes are well known to those of skill in the art 

Throughout this specification, unless the context requires otherwise, 
the words "comprise'*, "comprises** and "comprising** will be understood to imply 
the inclusion of a stated integer or group of integers but not the exclusion of any 
10 other integer or group of integers. 

2. Transcribed DNA sequences 

The promoter regions of the present invention were discovered by 
their location adjacent to the start of DNA sequences that were found to be 
transcribed at high levels in one or more tissues of sugarcane (Saccharum sp.). A 
15 hybridisation screening procedure, as described hereinafter, was used to identify 
genes expressed differentially in various tissues. However, it will be understood that 
the present invention is not restricted to use of any particular method for identifying 
such differentially expressed genes. For example, alternative procedures for 
identifying genes expressed differentially in various tissues include, but are not 
20 restricted to: cDNA and genomic subtractive hybridisation as for example described 
by Bulman and Neill (1996, In "Plant Gene Isolation: Principles and Practice ", 
GJ>. Foster and D. Twell, eds Chichester, UK, Wiley, pp 369-397); multi-probe 
fluorescent analysis of microscopic cDNA arrays as for example described by 
Schena (1996 BioEssays 18:427-431); mRNA differential display as for example 
25 described by Liang and Pardee (1992, Science 257:967-970) and by Callard et al. 
(1994, BioTechniques 16:1096-1 103); computer analysis of mRNA abundance based 
on frequency of occurrence of identical sequences emerging from large-scale 
sequencing of cDNA ends (ESTs) as for example taught by Cooke et al (1996, EST 
and genomic sequencing projects. In Plant Gene Isolation: Principles and Practice, 
30 supra, pp. 410-419); or promoter tagging by insertional mutagenesis with 
promoterless reporter genes as for example disclosed by Lindsey and Topping (1996, 
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T-DNA-mediated insertional mutagenesis. In Plant Gene Isolation: Principles and 
Practice, supra, pp. 275-300) and Mudge and Birch (1998, Austral 7. Plant Physiol 
25:637-643), which are all incorporated herein by reference. 

2.1. Stem-specific transcribed sequences 
5 The invention further provides DNA sequences that are transcribed 

preferentially at high levels in stem tissue, as compared to other tissues, of sugarcane 
plants. Exemplary sequences of this type are set forth in SEQ ID NO: 1 , 2, 3 and 4. 

SEQ ID NO: 1 is a cDNA of 976 nucleotides in length. This sequence 
has a 5' untranslated region (5' UTR) situated at position 1 to 32 and a 3' 

10 untranslated region (3* UTR) at position 594 to 881. A poly(A) tail is present at the 
end of this sequence. Two putative polyadenylation signals (AATAAA) are present 
between nucleotide 853 and nucleotide 862. An open reading frame (ORF) is present 
from nucleotide 33 to nucleotide 593. The nucleotide sequence and the deduced 
amino acid sequence (FIG. 14, SEQ ID NO: 16) do not show any homologues in the 

15 databases with more than 40% overall sequence identity, 

SEQ ID NO: 2, 3 and 4 are homologous cDNA sequences (FIG. 15). 
The 5* UTRs are present from nucleotide position I to 128 (FIG. 15) and the 3* 
UTRs from nucleotide position 576 to the end of thib sequences. The three cDNA 
sequences have poly(A) tails and a putative polyadenylation signal (AATAAA) is 
20 present at nucleotide position 810 to 81 5 (FIG. 1 5). The longest ORF is present from 
nucleotide position 129 to 575 (FIG. 15). The deduced amino acid sequence (SEQ 
ID NO: 17) is also presented in FIG. 15. A mutation in SEQ ID NO: 3 at position 
435 (FIG. 15) introduced an in-frame stop codon, but the presence of the SEQ ID 
NO: 3 in the cDNA library indicates that it is still being transcribed from a functional 
25 promoter. Screening of the databases (nucleotide sequences and protein sequences) 
only revealed one similar sequence: EST Zm474 (accession number W49474) from 
maize. This maize EST is similar (83% sequence identity, 3 gaps) to the nucleotide 
sequence between nucleotide position 692 and 887 (FIG. 15), which is located within 
the 3* UTR. 
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SEQ ID NO: 9 is a partial cDNA of 597 nucleotides in length. An 
ORF is present from nucleotide positions 1 to 243 (FIG. 26). Its deduced amino acid 
sequence (SEQ ID NO: 18) is homologous to the type 2 plant metallothionein-like 
proteins. Examples of such proteins can be found in the nucleic acid and protein 
5 databases under accession numbers P30564 (castor bean), P43391 and P25860 
(Arabidopsis thaliana) and others. A 3* UTR is present from nucleotide position 244 
to the end of the sequence, which includes a poly(A), tail (FIG. 26). 

SEQ ID NO: 10 is a cDNA of 1339 nucleotides in length. An ORF is 
present from nucleotide positions 124 to 1068 (FIG. 29, SEQ ID NO: 19). A 5* UTR 

10 is present from nucleotide positions 1 to 123 and a 3* UTR is present from 
nucleotide positions 1069 to the end of the sequence (FIG. 29). A putative 
polyadenylation signal is present between nucleotide positions 1301 and 1306 
(TATAAA, FIG. 29). The amino acid sequence deduced from the ORF (SEQ ID 
NO: 19) is homologous to homeobox genes / proteins that can be found in the 

15 databases (nucleic acids and proteins) under such accession numbers as AAD1361 1 
(maize), G4887610 (rice), BAA76750 (tobacco) and others. 

SEQ ID NO: 11 is a cDNA of 1046 nucleotides in length. This 
sequence has a partial ORF from nucleotide positions 1 to 566 (FIG. 31). Its deduced 
amino acid sequence (SEQ ID NO: 20) is homologous to auxin-induced or auxin- 

20 responsive proteins found in the databases (nucleic acids and proteins) under such 
accession numbers as, for example, ATI 8409 and AT18416 {Arabidopsis thaliana), 
GMAUX28 (soybean), AF123509 and AF123508 (tobacco) and others. A 3'UTR is 
present from nucleotide position 567 to the end of the sequence (FIG. 31), and 
includes multiple putative polyadenylation signals such as AAAAAA and AATAAT, 

25 and a poly(A) tail. 

SEQ ID NO: 16 is the amino acid sequence deduced from the ORF in 
the nucleotide sequence corresponding to SEQ ID NO: 1 (FIG. 14) (see SEQ ID NO: 

1). 

SEQ ID NO: 17 is the amino acid sequence deduced from the ORF in 
30 the nucleotide sequence corresponding to SEQ ID NO: 2, 3 and 4 (FIG. 15) (see 
SEQ ID NO: 2, 3 and 4). 
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SEQ ID NO: 18 is the amino acid sequence deduced from the ORF in 
the nucleotide sequence corresponding to SEQ ID NO: 9 (FIG. 26) (see SEQ ID NO: 
9). 

SEQ ID NO: 19 is the amino acid sequence deduced from, the ORF in 
5 the nucleotide sequence corresponding to SEQ ID NO: 10 (FIG. 29) (see SEQ ED 
NO: 10). 

SEQ ID NO: 20 is the amino acid sequence deduced from the ORF in 
the nucleotide sequence corresponding to SEQ ID NO: 11 (FIG. 31) (see SEQ ID 
NO: 11). 

10 2.2. Meristem-specific transcribed sequences 

SEQ ID NO: 7 is a partial cDNA of 366 nucleotides in length (FIG. 
23). No homologues could be found in the nucleic acids databases. 

SEQ ID NO: 8 is a partial sequence of a cDNA. This sequence of 395 
nucleotides is homologous to cellulase (EC 3.2.1.4) nucleotide sequences such as the 
15 ones found in the nucleic acids databases under the accession numbers, for example, 
THFE4AA and CFICENBAA (bacteria), AF 128404 (tobacco) or SLU20590 
(tomato). Part of this nucleotide sequence (from nucleotide 144 to nucleotide 395, 
FIG. 24) can be translated into an amino acid sequence homologous to cellulase 
amino acid sequence (FIG. 24, SEQ ID NO: 21) 

20 SEQ ID NO: 16 is die amino acid sequence derived from an ORF in 

the nucleotide sequence of the SEQ ID NO: 1 (FIG. 14). This amino acid sequence 
has no homologues in the protein databases (see SEQ ID NO: 1). 

SEQ ID NO: 21 is the amino acid sequence deduced from the ORF in 
the nucleotide sequence corresponding to SEQ ID NO: 8 (FIG. 24) (see SEQ ID NO: 

25 8). 

2.3. Constitutive transcribed sequences 

The invention also features DNA sequences that are transcribed 
constitutively in leaves, stems and roots of sugarcane. Preferred sequences of this 
type are set forth in SEQ ID NO: 5 and 6. 
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SEQ ID NO: 5 is a partial cDNA which includes the end of the coding 
sequence firom nucleotide position 1 to 43, the 3* UTR from nucleotide position 44 
to 279 and a poly(A) tail (FIG. 16). SEQ ED NO: 5 is homologous to SEQ ID NO: 6. 

SEQ ID NO: 6 is homologous to SEQ ID NO: 5 and contains a 5* 
5 UTR firom nucleotide position 1 to 91, a coding sequence from nucleotide position 
92 to 349, a 3* UTR from nucleotide position 350 to 551 anil a poly(A) tail (FIG. 
17). The deduced amino acid sequences from SEQ ID NO: 5 and 6 (FIG. 16, SEQ ID 
NO: 22; FIG. 17, SEQ ID NO: 23) correspond to a ribosomal protein S27, with 
numerous examples of homologues found in the databases (examples can be found 
10 under accession numbers D231399 for a rice mRNA sequence, X85544 for a barley 
mRNA or L19739 for a human mRNA). A-^ y^M 

SEQ ID NO: 22 is the amino acid sequence deduced from the ORF in 
the nucleotide sequence corresponding to SEQ ID NO: 5 (FIG. 16) (see SEQ ID NO: 

15 SEQ ID NO: 23 is the amino acid sequence deduced from the ORF in 

the nucleotide sequence corresponding to SEQ ID NO: 6 (FIG. 17) (see SEQ ID NO: 

•6). 

2.4. Polynucleotide seqnence variants of transcribed DNA sequences 

In one embodiment, isolated nucleic acid variants of the invention 
20 may be prepared according to the following procedure: 

(i) obtaining a nucleic acid extract from a suitable tissue of a 
plant, preferably a monocotyledonous plant; 

(ii) creating primers which are optionally degenerate wherein each 
comprises a portion of a transcribable DNA sequence of the 

25 invention; and 

(Hi) using said primers to amplify, via nucleic acid amplification 
techniques, at least one amplification product from said 
nucleic acid extract, wherein said amplification product 
corresponds to a polynucleotide sequence variant. 
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Suitable nucleic acid amplification techniques are well known to the 
skilled addressee, and include polymerase chain reaction (PCR) as for example 
described in Ausubel et ai (supra) which is incorporated herein by reference; strand 
displacement amplification (SDA) as for example described in U.S. Patent No 

5 5,422,252 which is incorporated herein by reference; rolling circle replication (RCR) 
as for example described in Liu et al y (1996, J. Am. Chenu Soc. 118:1587-1594 and 
International application WO 92/01 813) and Lizardi et a/., (Intranational Application 
WO 97/19193) which are incorporated herein by reference; nucleic acid sequence- 
based amplification (NASBA) as for example described by Sooknanan et al. 9 (1994, 

10 Biotechniques 17:1077-1080) which is incorporated herein by reference; and Q-fJ 
replicase amplification as for example described by Tyagi et aL y (1996, Proc Natl 
Acad. ScL USA 93:5395-5400) which is incorporated herein by reference. 

An embodiment if this method emptying inverse PCR and primers 
according to SEQ ID NOS 24 and 25 will be described hereinafter. 

15 Typically, polynucleotide sequence variants that are substantially 

complementary to a reference polynucleotide are identified by blotting techniques 
that include a step whereby nucleic acids are immobilised on a matrix (preferably a 
synthetic membrane such as nitrocellulose), followed by a hybridisation step, and a 
detection step. Southern blotting is used to identify a complementary DNA sequence; 

20 northern blotting is used to identify a complementary RNA sequence. Dot blotting 
and slot blotting can be used to identify complementary DNA/DNA, DNA/RNA or 
RNA/RNA polynucleotide sequences. Such techniques are well known by those 
skilled in the art, and have been described in Ausubel et al. (1994-1998, supra) at 
pages 2.9.1 through 2.9.20. 

25 According to such methods, Southern blotting involves separating 

; DNA molecules according to size by gel electrophoresis, transferring the size- 
separated DNA to a synthetic membrane, and hybridising the membrane-bound 
' DNA to a complementary nucleotide sequence labelled radioactively, enzymatically 
or fluorochromatically. In dot blotting and slot blotting, DNA samples are directly 
30 applied to a synthetic membrane prior to hybridisation as above. 

An alternative blotting step is used when identifying complementary 
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polynucleotides in a cDNA or genomic DNA library, such as through the process of 
plaque or colony hybridisation. A typical example of this procedure is described in 
Sambrook et aL ("Molecular Cloning. A Laboratory Manual", Cold Spring Harbour 
Press, 1989) Chapters 8-12. 

S Typically, the following general procedure can be used to determine 

hybridisation conditions. Polynucleotides are blotted/transferred to a synthetic 
membrane, as described above. A reference polynucleotide such as a polynucleotide 
of the invention is labelled as described above, and the ability of this labelled 
polynucleotide to hybridise with an immobilised polynucleotide is analysed. 

10 A skilled addressee will recognize that a number of factors influence 

hybridisation. The specific activity of radioactively labelled polynucleotide sequence 
should typically be greater than or equal to about 10 8 dpm/mg to provide a detectable 
signal. A radiolabeled nucleotide sequence of specific activity 10* to 10 9 dpm/mg 
can detect approximately 0.5 pg of DNA. It is well known in die art that sufficient 

15 DNA must be immobilised on the membrane to permit detection. It is desirable to 
have excess immobilised DNA, usually 10 jig. Adding an inert polymer such as 10% 
• (w/v) dextran sulfate (MW 500,000) or polyethylene glycol 6000 during 
hybridisation can also increase the sensitivity of hybridisation (see Ausubel supra at 
2.10.10). 

20 To achieve meaningful results from hybridisation between a 

polynucleotide immobilised on a membrane and a labelled polynucleotide, a 
sufficient amount of the labelled polynucleotide must be hybridised to the 
immobilized polynucleotide following washing. Washing ensures that the labelled 
polynucleotide is hybridised only to the immobilized polynucleotide with a desired 

25 degree of complementarity to the labelled polynucleotide. 

It will be understood that polynucleotide sequence variants according 
to the invention will hybridise to a reference polynucleotide under at least low 
stringency conditions. Reference herein to low stringency conditions include and 
encompass from at least about 1% v/v to at least about 15% v/v formamide and from 

30 at least about 1 M to at least about 2 M salt for hybridisation at 42°C, and at least 
about 1 M to at least about 2 M salt for washing at 42°C. Low stringency conditions 
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also may include 1% Bovine Serum Albumin (BSA), 1 mM EDTA, 0.5 M NaHP0 4 
(pH 7.2), 7% SDS for hybridisation at 65°C, and (i) 2xSSC, 0.1% SDS; or (ii) 0.5% 
BSA, 1 mM EDTA, 40 mM NaHP0 4 (pH 7.2), 5% SDS for washing at room 
temperature. 

5 Preferably, the polynucleotide sequence variants hybridise to a 

reference polynucleotide under at least medium stringency conditions. Medium 
stringency conditions include and encompass from at least about 16% v/v to at least 
about 30% v/v formamide and from at least about 0.5 M to at least about 0.9 M salt 
for hybridisation at 42°C, and at least about 0.5 M to at least about 0.9 M salt for 
10 washing at 42°C. Medium stringency conditions also may include 1% Bovine Serum 
Albumin (BSA), 1 mM EDTA, 0.5 M NaHP0 4 (pH 7.2), 7% SDS for hybridisation 
at 65°C, and (i) 2 x SSC, 0.1% SDS; or (ii) 0.5% BSA, 1 mM EDTA, 40 mM 
NaHP0 4 (pH 7.2), 5% SDS for washing at 42°C. 

More preferably, the polynucleotide sequence variants hybridise to a 
15 reference polynucleotide under high stringency conditions. High stringency 
conditions include and encompass from at least about 31% v/v to at least about 50% 
v/v formamide and from at least about 0.01 M to at least about 0.15 M salt for 
hybridisation at 42°C, and at least about 0.01 M to at least about 0.15 M salt for 
washing at 42°C. High stringency conditions also may include 1% BSA, 1 mM 
20 EDTA, 0.5 M NaHP0 4 (pH 7.2), 7% SDS for hybridisation at 65°C, and (i) 0.2 x 
SSC, 0.1% SDS; or (ii) 0.5% BSA, ImM EDTA, 40 mM NaHP0 4 (pH 7.2), 1% SDS 
for washing at a temperature in excess of 65°C. 

Other stringent conditions are well known in the art. A skilled 
addressee will recognize that various factors can be manipulated to optimize the 
25 specificity of the hybridisation. Optimisation of the stringency of the final washes 
can serve to ensure a high degree of hybridisation. For detailed examples, see 
Ausubel et a/., supra at pages 2.10.1 to 2.10.16 and Sambrook et al (1989, supra) at 
sections 1.101 to 1.104, which are incorporated herein by reference. 

While stringent washes are typically carried out at temperatures from 
30 about 42°C to 68°C, one skilled in the art will appreciate that other temperatures may 
be suitable for stringent conditions. Maximum hybridisation typically occurs at about 
20°C to 25°C below the T m for formation of a DNA-DNA hybrid. It is well known in 
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the ait that the T ro is the melting temperature, or temperature at which two 
complementary polynucleotide sequences dissociate. Methods for estimating T m are 
well known in the art (see Ausubel et aL 9 supra at page 2.10.8). 

In general, the T m of a duplex DNA decreases by about 1°C with 

5 every increase of 1% in the number of mismatched base pairs. 

In a preferred hybridisation procedure, a membrane (e.£., a 
nitrocellulose membrane or a nylon membrane) containing immobilised DNA is 
hybridised overnight at 42°C in a hybridisation buffer (50% deionised formamide, 
5xSSC, 5x Denhardfs solution (0.1% ficoll, 0.1% polyvinylpyrollidone and 0.1% 

10 bovine serum albumin), 0.1% SDS and 200 mg/mL denatured salmon spam DNA) 
containing labelled probe. The membrane is then subjected to two sequential 
medium stringency washes (i.e., 2xSSC/0.1% SDS for 15 min at 45°C, followed by 
2xSSC/0.1% SDS for 15 min at 50*C), followed by two sequential high stringency 
washes (Le. 9 0.2xSSCV0.1% SDS for 12min at 55°C followed by 0.2xSSC and 

15 0.1%SDS solution for 12 min). 

Methods for detecting a labelled polynucleotide hybridised to an 
immobilised polynucleotide are well known to practitioners in the art. Such methods 
include autoradiography, phosphoiimaging, chemiluminescent, fluorescent and 
colorimetric detection. 

20 3. Promoter sequences of the invention 

3.1. Promoters regions of specific transcribed DNA sequences 

The invention also provides promoter regions isolated adjacent to the 
start of the transcribed DNA sequences described in Section 2. In particular, stem- 
specific and constitutive promoters for expression of chimeric or heterologous genes 
25 in plants, preferably monocotyledonous plants are provided. Preferred stem-specific 
promoters of the invention may be selected from the polynucleotide sequences set 
forth in SEQ ID NO: 12 or SEQ ID NO: 13. Preferred constitutive promoters of the 
invention include the polynucleotides set forth in SEQ ID NO: 14 or SEQ ID NO: 
15. 

30 The invention also contemplates biologically-active fragments of any 



WO 01/18211 



PCT/AUOO/01033 



33 

one of SEQ ID NO: 12, 13, 14 or IS as well as polynucleotide sequence variants 
thereof Those of skill in the art will understand that a biologically-active fragment 
of a promoter sequence, when fused to a particular gene and introduced into a plant 
cell, causes expression of the gene at a level higher than is possible in the absence of 
S such fragment The activity of a promoter can be determined by methods well known 
in the art For example, reference may be made to Medberry et ah (1992, Plant Cell 
4:185; 1993, The Plant J. 3:619, incorporated herein by reference), Sambrook et aL 
(1989, supra) and McPherson et al (U.S. Patent No. 5,164,316, incorporated herein 
by reference). 

10 3.2. Promoter variants 

Promoter variants that are substantially complementary to a reference 
promoter of the invention may be obtained by procedures outlined in Section 2.3. 

In general, variants comprise regions that show at least 70%, more 
suitably at least 80%, preferably at least 90%, and most preferably at least 95% 
15 sequence identity over a reference promoter sequence of identical size ^comparison 
window* 9 ) or when compared to an aligned sequence in which the alignment is 
performed by a computer homology program known in the art What constitutes 
suitable variants may be determined by conventional techniques. For example, 
polynucleotides according to SEQ ID NO: 12, 13, 14 and 15 can be mutated using 
20 random mutagenesis (e.#, transposon mutagenesis), oUgonucleotide-mediated (or 
site-directed) mutagenesis, PCR mutagenesis and cassette mutagenesis of an earlier 
prepared variant or non-variant version of an isolated natural promoter according to 
the invention. 

Oligonucleotide-mediated mutagenesis is a preferred method for 
25 preparing nucleotide substitution variants of a promoter of the invention. This 
technique is well known in the art as, for example, described by Adelman et al. 
(1983, DNA 2:183). Briefly, promoter DNA is altered by hybridising an 
oligonucleotide encoding the desired mutation to a template DNA, where the 
template is the single^stranded form of a plasmid or bacteriophage containing the 
30 unaltered or native DNA sequence of the promoter of interest. After hybridisation, a 
DNA polymerase is used to synthesise an entire second complementary strand of the 
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template (hat will thus incorporate the oligonucleotide primer, and will code for the . 
selected alteration in the promoter of interest 

Generally, oligonucleotides of at least 25 nucleotides in length are 
used. An optimal oligonucleotide will have 12 to IS nucleotides that are completely 
5 complementary to the template on either side of the nucleotides) coding for the 
mutation. This ensures that the oligonucleotide will hybridise properly to the single- 1 
stranded DNA template molecule. ' ' ^ - ^ 

The DNA template can be generated by those vectors that are either ; 
derived from bacteriophage M13 vectors, or those vectors that contain a single- 
10 stranded phage origin of replication as described by Viera et ah (1987, Methods ] 
EnzymoL 153:3). Thus, the DNA that is to be mutated may be inserted into one of 
the vectors to generate single-stranded template. Rwluctionyof single-stranded j) 
: template is described, for example, in Sections 4.21-4.41 of Sambrook et aL (1989, ' \ 
supra). 

15 Alternatively, the single-stranded template may be generated by 

denaturing double-stranded plasmid (or other DNA) using standard techniques. 

For alteration of the native DNA sequence, the oligonucleotide is 
hybridised to the single-stranded template under suitable hybridisation conditions. A 
DNA polymerising enzyme, usually the Klenow fragment of DNA polymerase I, is 

20 then added to synthesise the complementary strand of the template using the 
oligonucleotide as a primer for synthesis. A heteroduplex molecule is thus formed 
such that one strand of DNA encodes the mutated form of the promoter under test, 
and the other strand (the original template) encodes the native unaltered sequence of 
the promoter under test. This heteroduplex molecule is then transformed into a 

25 suitable host cell, usually a prokaryote such as E. coli. After the cells are grown, they 
are plated onto agarose plates and screened using the oligonucleotide primer having 
a detectable label to identify the bacterial colonies having the mutated DNA. The 
resultant mutated DNA fragments are then cloned into suitable expression hosts such 
as E. coli using conventional technology and clones that retain the desired promoter 

30 activity are detected. Where the clones have been derived using random mutagenesis 
techniques, positive clones would have to be sequenced in order to detect the 
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mutation. 

Alternatively, linker-scanning mutagenesis of DNA may be used to 
introduce clusters of point mutations throughout a sequence of interest that has been 
cloned into a plasmid vector. For example, reference may be made to Ausubel et al. 9 
5 supra 9 (in particular, Chapter 8.4,. incorporated herein by reference) which describes 
a first protocol that uses complementary oligonucleotides and requires a unique 
restriction site adjacent to the region that is to be mutagenised. A nested series of 
deletion mutations is first generated in the region. A pair of complementary 
oligonucleotides is synthesised to fill in the gap in the sequence of interest between 
10 the linker at the deletion endpoint and the nearby restriction site. The linker sequence 
actually provides the desired clusters of point mutations as it is moved or "scanned" 
across the region by its position at the varied endpoints of the deletion mutation 
series. An alternate protocol is also described by Ausubel et'aL % supra* which makes 
use of site directed mutagenesis procedures to introduce small clusters of point 
15 mutations throughout the target region. Briefly, mutations are introduced into a 
sequence by annealing a synthetic oligonucleotide containing one or more 
mismatches to the sequence of interest cloned into a single-stranded M13 vector. 
This template is grown in an Escherichia coli duf ung strain, which allows the 
incorporation of uracil into the template strand The oligonucleotide is annealed to 
20 the template and extended with T4 DNA polymerase to create a double-stranded 
heteroduplex. Finally, the heteroduplex is introduced into a wild-type R coli strain, 
which will prevent replication of the template strand due to the presence of apurinic 
sites (generated where uracil is incorporated), thereby resulting in plaques containing 
only mutated DNA. 

25 V Region-specific mutagenesis and directed mutagenesis using PCR 

may also be employed to construct promoter variants according to the invention. In 
this regard, reference may be made, for example, to Ausubel et aL 9 supra, in 
: particular Chapters 8.2A and 8.5, which are incorporated herein by reference. 

4. Chimeric DNA constructs and exp ression vectors 
30 An isolated nucleic acid promoter or variant according to the 

! invention can be fused to a heterologous nucleic acid to form a chimeric gene. The 
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heterologous nucleic acid may be a foreign or endogenous DNA sequence. For the 
purposes of transformation and expression of the heterologous nucleic acid in plants, 
it is preferred that the chimeric gene includes regulatory sequences which influence 
expression of the heterologous nucleic acid in plants. Preferably, the chimeric gene is 
5 present in an expression vector which includes regulatory sequences that enable 
selective propagation in bacteria. 

4.1 y Non-translated region 

A 3' non-translated sequence refers to that portion of a gene 
comprising a DNA segment that contains a polyadenylation signal and any other 
10 regulatory signals capable of effecting mRNA processing or gene expression. The 
polyadenylation signal is characterized by effecting the addition of polyadenylic acid 
tracts to the 3* end of the mRNA precursor. Polyadenylation signals are commonly 
recognized by the presence of homology to the canonical form 5' AATAAA-3' 
although variations are not uncommon. 

15 The 3* non-translated regulatory DNA sequence preferably includes 

from about SO to 1,000 nucleotide base pairs and contains plant transcriptional and 
% . translational termination sequences. Examples of suitable 3' non-translated 
- sequences are the 3' transcribed non-translated regions containing a polyadenylation 
signal from the nopaline synthase (nos) gene of Agrobacterium tumefaciens (Bevan 

20 et al.y 1983, Nucl Acid Res. 9 11:369) and the terminator for the 17 transcript from 
the octopine synthase gene of Agrobacterium tumefaciens. Alternatively, suitable 3' 
non-translated sequences may be derived from plant genes such as the 3' end of the 
protease inhibitor I or II genes from potato or tomato, the soybean storage protein 
genes and the pea E9 small subunit of the ribulose-l,5-bisphosphate carboxylase 

25 (ssRUBISCO) gene, although other 3* elements known to those of skill in the art can 
also be employed. Alternatively, 3' non-translated regulatory sequences can be 
obtained de novo as, for example, described by An (1987, Methods in Enzymology, 
153:292), which is incorporated herein by reference. 
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4>2 Optional sequences 

The chimeric DNA construct of the present invention can further 
include enhancers, either translation or transcription enhancers, as may be required. 
These enhancer regions are well known to persons skilled in the art, and can include 

5 the ATG initiation codon and adjacent sequences. The initiation codon must be in 
phase with the reading frame of the coding sequence relating to the foreign or 
endogenous DNA sequence to ensure translation of the entire sequence. The 
translation control signals and initiation codons can be of a variety of origins, both 
natural and synthetic. Translational initiation regions may be provided from the 

10 source of the transcriptional initiation region, or from the foreign or endogenous 
DNA sequence. The sequence can also be derived from the source of the promoter 
selected to drive transcription, and can be specifically modified so as to increase 
translation of the mRNA. 

Examples of transcriptional enhancers include, but are not restricted 
15 to, elements from the CaMV 35S promoter and octopine synthase genes as for 
example described by Last et ah (U.S. Patent No. 5,290,924, which is incorporated 
herein by reference). It is proposed that the use of an enhancer element such as the 
ocs element, and particularly multiple copies of the element, will act to increase the 
level of transcription from adjacent promoters when applied in the context of plant 
20 transformation. 

As the DNA sequence inserted between the transcription initiation site 
and the start of the coding sequence, Lc 3 the untranslated leader sequence, can 
influence gene expression, one can also employ a particular leader sequence. 
Preferred leader sequences include those that comprise sequences selected to direct 

25 optimum expression of the foreign or endogenous DNA sequence. For example, such 
leader sequences include a preferred consensus sequence which can increase or 
maintain mRNA stability and prevent inappropriate initiation of translation as for 
example described by Joshi (1987, Nucl Acid Res., 15:6643), which is incorporated 
herein by reference. However, other leader sequences, cg. y the leader sequence of 

30 RTBV, have a high degree of secondary structure that is expected to decrease mRNA 
stability and/or decrease translation of the mRNA. Thus, leader sequences (i) that do 
not have a high degree of secondary structure, (ii) that have a high degree of 
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secondary structure where the secondary structure does not inhibit mRNA stability 
and/or decrease translation, or (iii) that are derived from genes that are highly 
expressed in plants, will be most preferred. 

Regulatory elements such as the sucrose synthase intron as, for example, 
5 described by Vasil et al (1989, Plant Physiol, 91:5175), the Adh intron I as, for 
example, described by Callis et al. (1987, Genes Develop., II), or the TMV omega 
element as, for example, described by Gallie et al (1989, The Plant Cell, 1:301) can 
also be included where desired. Other such regulatory elements useful in the practice 
of the invention are known to those of skill in die art 
10 Additionally, targeting sequences may be employed to target a protein 

product of the foreign or endogenous DNA sequence to an intracellular compartment 
within plant cells or to the extracellular environment For example, a DNA sequence 
• encoding a transit or signal peptide sequence may be operably linked to a sequence 
encoding a desired protein such that, when translated, the transit or signal peptide 
15 can transport the protein to a particular intracellular or extracellular destination, 
respectively, and can then be post-translationally removal. Transit or signal peptides 
act by facilitating the transport of proteins through intracellular membranes, eg-., 
vacuole, vesicle, plastid and mitochondrial membranes, whereas signal peptides 
direct proteins through the extracellular membrane. For example, the transit or signal 
20 peptide can direct a desired protein to a particular organelle such as a plastid (e.g., a 
chloroplast), rattier than to the cytoplasm. Thus, the chimeric DNA construct can 
further comprise a plastid transit peptide encoding DNA sequence operably linked 
between a promoter region or promoter variant according to the invention and the 
foreign or endogenous DNA sequence. For example, reference may be made to 
25 Heijne et al (1989, Eur. J. Biochenu, 180:535) and Keegstra et al. (1989, Ann. Rev. 
Plant Physiol Plant Mol Biol, 40:471), which are incorporated herein by reference. 

A chimeric DNA construct can also be introduced into a vector, such as 
a plasmid. Plasmid vectors include additional DNA sequences that provide for easy 
selection, amplification, and transformation of the expression cassette in prokaryotic 
30 and eukaryotic cells, e.g. , pUC-derived vectors, pSK-deri ved vectors, pGEM-derived 
vectors, pSP-derived vectors, or pBS-derived vectors. Additional DNA sequences 
include origins of replication to provide for autonomous replication of the vector, 
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selectable marker genes, preferably encoding antibiotic or herbicide resistance, 
unique multiple cloning sites providing for multiple sites to insert DNA sequences or 
genes encoded in the chimeric DNA construct, and sequences that enhance 
transformation of prokaryotic and eukaryotic cells. 
5 The vector preferably contains an element(s) that permits stable 

integration of the vector into the host cell genome or autonomous replication of the 
vector in the cell independent of the genome of the cell. The vector may be 
integrated into the host cell genome when introduced into a host cell. For integration, 
the vector may rely on the foreign or endogenous DNA sequence or any other 
10 element of the vector for stable integration of the vector into the genome by 
homologous recombination. Alternatively, the vector may contain additional nucleic 
acid sequences for directing integration by homologous recombination into the 
genome of the host cell. The additional nucleic acid sequences enable the vector to 
be integrated into the host cell genome at a precise location in the chromosome. To 
15 increase the likelihood of integration at a precise location, the integrational elements 
should preferably contain a sufficient number of nucleic acids, such as 100 to 1,500 
base pairs, preferably 400 to 1,500 base pairs, and most preferably 800 to 1,500 base 
pairs, which are highly homologous with the corresponding target sequence to 
enhance the probability of homologous recombination. The integrational elements 
20 may be any sequence that is homologous with the target sequence in the genome of 
the host cell. Furthermore, the integrational elements may be non-encoding or 
encoding nucleic acid sequences. 

For autonomous replication, the vector may further comprise an origin of 
replication enabling the vector to replicate autonomously in the host cell in question. 
25 Examples of bacterial origins of replication are the origins of replication of plasmids 
pBR322, pUC19, pACYC177, and pACYC184 permitting replication in E. coli, and 
pUBHO, pE194, pTA1060, and pAM.beta.1 permitting replication in Bacillus. The 
origin of replication may be one having a mutation to make its function temperature- 
sensitive in a Bacillus cell (see, eg., Ehrlich, 1978, Proc. Natl. Acaa\ ScL USA 
30 75:1433). 

43 Marker genes 

To facilitate identification of transformants, the chimeric DNA 
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construct desirably comprises a selectable or screenable marker gene as, or in . . 'ttiy^^iJ^t 
addition to, the expressible foreign or endogenous DNA sequence. The actual choice V , ; >•? V':- :v;: 

of a marker is not crucial as long as it is functional (Le., selective) in combination v f 
with the plant cells of choice. The marker gene and the foreign or endogenous DNA ( ; ( • ! ■ .">Y£ •'.[ v' •) -'vV-k.- 
5 sequence of interest do not have to be linked, since co-transformation of unlinked ; 0/;^//^^ 
genes as, for example, described in U.S. Pat No. 4,399,216 is also an efficient /ixVA/^ 
{v ? , .': process in plant transformation- /;V; : > ••'^> ' < "-'' v ' 

Included within the terms selectable or screenable marker genes are 
genes that encode a "secretable marker? whose secretion can be detected as a means ' 
10 of identifying or selecting for transformed cells. Examples include markers that ; / 

encode a secretable antigen that can be identified by antibody interaction, m •i^/^/\;/k<: <s t ■ 
secretable enzymes that can be detected by their catalytic activity. Secretable 
• / proteins include, but are not restricted to, proteins that are inserted or trapped in the ffe ' >a^<* >*iU!i\ { >;\% i ^ 

cell wall (e.g. 9 proteins that include a leader sequence such as that found in the v: V v Y- v r S>< v ://'/'- 
15 expression unit of extensin or tobacco PR-S); small, diffusible proteins detectable, ; ^fW^W 
eg. by ELJSA; and small active enzymes detectable in extracellular solution (e.#., a- 
amylase, p-lactamase, phosphinothricin acetyltransferase). 

4.3.1 Selectable markers 

Examples of bacterial selectable markers are the dal genes from 
20 Bacillus subtilis ox Bacillus licheniform 

such as ampicillin, kanamycin, erythromycin, chloramphenicol or tetracycline 
resistance. Exemplary selectable markers for selection of plant transformants ^ yy '^y; \:'y"y 
include, but are not limited to, a hyg gene which encodes hygromycin B resistance; a /-V^^^.K- 
neomycin phosphotransferase (neo) gene conferring resistance to kanamycin, 
25 paromomycin, G418 and the like as, for example, described by Potrykus e* a/. (1985, 

MoL Geru Genet 199:183); a glutathione-S-transferase gene from rat liver , •.. 
conferring resistance to glutathione derived herbicides as, for example, described in 
EP-A 256 223; a glutamine synthetase gene conferring, upon overexpression, .\.-y;h]':^ ; 
resistance to glutamine synthetase inhibitors such as phosphinothricin as, fiv ^.y^;::'^;., \ v"; 8 .-' 
• 30 \ example, described WO87/05327, an acetyl transferase gene from Streptomy<^^^y^:\ ■ ■ ] : Y \ yi 
viridochromogenes conferring ^stance to the selective agent phosphinothricin as, ^^^^^^^^.^ . v ^ 
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for example, described in EP-A 275 957, a gene encoding a 5-enolshikimate-3- 
phosphate synthase (EPSPS) conferring tolerance to N-phosphonomethylglycine as, 
for example, described by Hinchee et al. (1988, Biotech.* 6:915), a bar gene 
conferring resistance against bialaphos as, for example, described in W09 1/02071; a 

5 nitrilase gene such as bxn from Klebsiella ozaenae which confers resistance to 
bromoxynil (Stalker et al, 1988, Science, 242:419); a dihydrofblate reductase 
: (DHFR) gene conferring resistance to methotrexate (Thillet et al., 1988, J. Biol 
ChenL, 263:12500); a mutant acetolactate synthase gene (ALS), which confers 
resistance to imidazolinone, sulfonylurea or other ALS-inhibiting chemicals (EP-A- 

10 154 204); a mutated anthranilate synthase gene that confers resistance to 5 -methyl 
tryptophan; or a dalapon dehalogenase gene that confers resistance to the herbicide. 

43.2 Screenable markers 

Preferred screenable markers include, but are not limited to, a uidA 
gene encoding a p- glucuronidase (GUS) enzyme for which various chromogenic 
15 substrates are known; a p-galactosidase gene encoding an enzyme for which 
chromogenic substrates are known; an aequorin gene (Prasher et al., 1985, Biochem. 
Biophys. Res. Comm., 126:1259), which may be employed in calcium-sensitive 
bioluminescence detection; a green fluorescent protein gene (Niedz et al, 1995 Plant 
Cell Reports, 14:403); a luciferase (luc) gene (Ow et aL, 1986, Science, 234:856), 
20 which allows for bioluminescence detection; a (^-lactamase gene (Sutcliffe, 1978, 
Proa Natl. Acad. ScL USA 75:3737), which encodes an enzyme for which various 
chromogenic substrates are known (e.g., PADAC, a chromogenic cephalosporin); an 
R-locus gene, encoding a product that regulates the production of anthocyanin 
; ; pigments (red color) in plant tissues (Dellaporta et al., 1988, in Chromosome 
25 Structure and Function, pp. 263-282); an a-amylase gene (Dcuta et aL, 1990, 
/ Biotech., 8:241); a tyrosinase gene (Katz et al, 1983, J. Gen. Microbiol, 129:2703) 
which encodes an enzyme capable of oxidizing tyrosine to dopa and dopaquinone 
i which in turn condenses to form the easily detectable compound melanin; or a xylE 
gene (Zukowsky et al, 1983, Proc Natl Acad. Sci. USA 80:1101), which eacodes a 
30 catechol dioxygenase that can convert chromogenic catechols. 
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5. Uses of the promoters of the invention 

The isolated nucleic acid promoters of the invention may be used, 
inter alia, to drive expression of a foreign or endogenous DNA sequence. Preferred 
agronomic properties encoded by the foreign or endogenous DNA sequence include, 
5 but are not limited to, insect resistance or tolerance, herbicide resistance or tolerance, 
disease resistance or tolerance (e.g y resistance to viruses or fungal pathogens), stress 
tolerance (increased salt tolerance) and improved food content or increased yields. 

The foreign or endogenous DNA sequence may comprise a region 
transcribed into a molecule that modulates the expression of a corresponding target 
10 gene. The molecule may be an antisense RNA or a ribozyme or other transcript 
aimed at downregulation of expression of the corresponding target gene. 

Anti-sense regulation and the use of ribozymes and co-suppression in 
plants are well known in the art. However, the skilled person is referred to United 
States Patent 5,759,829 for an example of antisense technology and to U.S. patent 
15 5,707,835, U.S. patent 5,747,335 and U.S. patent 5,840,874 which each provide 
examples of ribozyme technology. With regard to co-suppression, reference is made 
to U.S. patent 5,283,184, U.S. patent 5,686,649 arid International Publication 
WO98/53083 for examples of this technology. Each of these patent documents is 
incorporated herein by reference. 

20 Alternatively, the foreign or endogenous DNA sequence may encode 

a molecule which is readily detectable or measurable, e.g. P-glucuronidase or 
luciferase; a selectable product, eg., neomycin phosphotransferase (nptH) conferring 
resistance to aminogjycosidic antibiotics such as geneticin and paromomycin; a 
product conferring herbicide tolerance, e.g. glyphosate resistance or glufosinate 

25 resistance; a product affecting starch biosynthesis or modification e.g. starch 
branching enzyme, starch synthases, ADP-glucose pyrophosphorylase; a product 
involved in fatty acid biosynthesis, e.g. desaturase or hydroxylase; a product 
conferring insect resistance, e.g. crystal toxin protein of Bacillus thuringiensis; a 
product conferring viral resistance, e.g. viral coat protein; a product conferring 
30 fungal resistance, e.g. chitinase, (M,3-glucanase or phytoalexin; a product altering 
sucrose metabolism, e.g. invertase or sucrose synthase; a gene encoding valuable 
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pharmaceuticals, e.g. antibiotics, secondary metabolites, pharmaceutical peptides or 
vaccines. 

6. Introduction of isolated nucleic acids into plant cells 

A number of techniques are available for the introduction of DNA 
S into a plant host cell. There are many plant transformation techniques well known to 
workers in the art, and new techniques are continually becoming known. The 
particular choice of a transformation technology will be determined by its efficiency 
to transform certain plant species as well as the experience and preference of the 
person practising the invention with a particular methodology of choice. It will be 
10 apparent to the skilled parson that the particular choice of a transformation system to 
introduce a chimeric DNA construct into plant cells is not essential to or a limitation 
of the invention, provided it achieves an acceptable level of nucleic acid transfer. 
Guidance in the practical implementation of transformation systems for plant 
improvement is provided by Birch (1997, Annu. Rev. Plant Physiol. Plant Molec. 
1 5 Biol. 48: 297-326), which is incorporated herein by reference. 

In principle both dicotyledonous and monocotyledonous plants that 
are amenable to transformation, can be modified by introducing a chimeric DNA 
construct according to the invention into a recipient cell and growing a new plant 
that harbors and expresses the foreign or endogenous DNA sequence. 

20 Introduction and expression of foreign or chimeric DNA sequences in 

dicotyledonous (broadleafed) plants such as tobacco, potato and alfalfa has been 
shown to be possible using the T-DNA of the tumor-inducing (Ti) plasmid of 
Agrobacterium tumefaciens (See, for example, Umbeck, U.S. Patent No. 5,004,863, 
and International application PCT/US93/02480). A construct of the invention may be 

25 introduced into a plant cell utilizing A. tumefaciens containing the Ti plasmid. In 
using an A. tumefaciens culture as a transformation vehicle, it is most advantageous 
to use a non-oncogenic strain of the Agrobacterium as the vector carrier so that 
normal non-oncogenic differentiation of the transformed tissues is possible. It is 
preferred that the Agrobacterium harbors a binary Ti plasmid system. Such a binary 

30 system comprises (1) a first Ti plasmid having a virulence region essential for the 
introduction of transfer DNA (T-DNA) into plants, and (2) a chimeric plasmid. The 
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chimeric plasmid contains at least one border region of the T-DNA region of a wild- 
type Ti plasmid flanking the nucleic acid to be transferred. Binary Ti plasmid 
systems have been shown effective to transform plant cells as, for example, 
described by De Framond (1983, Biotechnology* 1:262) and Hoekema et al. (1983, 
5 Nature, 303:179). Such a binary system is preferred inter alia because it does not 
require integration into the Ti plasmid in Agrobacterium. 

Methods involving the use of Agrobacterium include, but are not 
limited to: (a) co-cultivation of Agrobacterium with cultured isolated protoplasts; (b) 
transformation of plant cells or tissues with Agrobacterium^ or (c) transformation of 
1 0 seeds, apices or meristems with Agrobacterium. 

Recently, rice, com, pineapple and sugarcane, which are monocots, 
have been shown to be susceptible to transformation by Agrobacterium, for example 
as described in United States Patent No. 6,037,522, International Publication 
W099/36637 and Arencibia et al. (1998, Transgenic Res. 7:213). However, some 
15 monocot crop plants have not yet been successfully transformed using 
Agrobacterium-mediated transformation. The Ti plasmid, however, may be 
manipulated in the future to act as a vector for these other monocot plants. 
Additionally, using the Ti plasmid as a model system, it may be possible to 
artificially construct transformation vectors for these plants. Ti plasmids might also 
20 be introduced into monocot plants by artificial methods such as microinjection, or 
fusion between monocot protoplasts and bacterial spheroplasts containing the T- 
region, which can then be integrated into the plant nuclear DNA. 

In addition, gene transfer can be accomplished by in situ 
transformation by Agrobacterium, as described by Bechtold et al (1993, C.R. Acad. 
25 Set Paris, 316:1194). This approach is based on the vacuum infiltration of a 
suspension of Agrobacterium cells. 

Alternatively, nucleic acids may be introduced using root-inducing 
(Ri) plasmids of Agrobacterium as vectors. 

Cauliflower mosaic virus (CaMV) may also be used as a vector for 
30 introducing of exogenous nucleic acids into plant cells (U.S. Pat. No. 4,407,956). 
CaMV DNA genome is inserted into a parent bacterial plasmid creating a 
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recombinant DNA molecule that can be propagated in bacteria. After cloning, the 
recombinant plasmid again may be cloned and further modified by introduction of 
the desired nucleic acid sequence. The modified viral portion of the recombinant 
plasmid is thai excised from the parent bacterial plasmid, and used to inoculate the 
5 plant cells or plants. 

Nucleic acids can also be introduced into plant cells by 
electroporation as, for example, described by Fromm et aL (1985, Proa Natl. Acad. 
ScL, U&A, 82:5824) and Shimamoto et at. (1989, Nature 338:274-276). In this 
technique, plant protoplasts are electroporated in the presence of vectors or nucleic 
1 0 acids containing the relevant nucleic acid sequences. Electrical impulses of high field 
strength reversibly permeabilise membranes allowing the introduction of nucleic 
acids. Electroporated plant protoplasts reform the cell wall, divide and form a plant 
callus. 

Another method for introducing nucleic acids into a plant cell is high 
15 velocity ballistic penetration by small particles (also known as particle bombardment 
or microprojectile bombardment) with the nucleic acid to be introduced contained 
either within the matrix of small beads or particles, or on the surface thereof as, for 
example described by Klein et aL (1987, Nature 327:70). Although typically only a 
single introduction of a new nucleic acid sequence is required, this method 
20 particularly provides for multiple introductions. 

Alternatively, nucleic acids can be introduced into a plant cell by 
contacting the plant cell using mechanical or chemical means. For example, a nucleic 
acid can be mechanically transferred by microinjection directly into plant cells by 
use of micropipettes. Alternatively, a nucleic acid may be transferred into the plant 
25 cell by using polyethylene glycol which forms a precipitation complex with genetic 
material that is taken up by the cell. 

Also contemplated are silicon carbide or tungsten whiskers, for 
example as described in United States Patent No. 5,302,523. 

There are a variety of methods known currently for transformation of 
30 monocotyledonous plants. Presently, preferred methods for transformation of 
monocots are microprojectile bombardment of explants or suspension cells, and 
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direct DNA uptake or electroporation as, for example, described by Shimamoto et aL 
(1989, supra). Transgenic maize plants have been obtained by introducing the 
' Streptomyces hygroscopicus bar gene into embryogenic cells of a maize suspension 
culture by microprojectile bombardment (Gordon-Kamm, 1990, Plant Cell, 2:603- 
5 ' 618). The introduction of genetic material into aleurone protoplasts of other - 
monocotyledonous crops such as wheat and barley has been reported (Lee, 1989, 
Plant Afol. BioL 13:21-30). Wheat plants have been regenerated from embryogenic : 
suspension culture by selecting only the aged compact and nodular embryogenic 
callus tissues for the establishment of the embryogenic suspension cultures (Vasil, 
10 1990, Bio/Technol 8:429-434). The combination with transformation systems for 
these crops enables the application of the present invention to monocots. These 
methods may also be applied for the transformation and regeneration of dicots. ' 
: Transgenic sugarcane plants have been regenerated from embryogenic callus as, for ' ' 
example, described by Bower et al. (1996, Molecular Breeding 2:239-249). 

15 Alternatively, a combination of different techniques may be employed 

to enhance the efficiency of the transformation process, e.g„, bombardment with 
Agrobacterium coated microparticles (EP-A-486234) or microprojectile 
bombardment to induce wounding followed by co-cultivation with Agrobacterium 
(EP-A-486233). 

20 The patent and scientific publications referred to above are all 

incorporated herein by reference. 

7. Production and characterisation of transgenic plants 

7.1 Regeneration 

The methods used to regenerate transformed cells into differentiated 
25 plants are not critical to this invention, and any method suitable for a target plant can 
be employed. Normally, a plant cell is regenerated to obtain a whole plant following 
* . a transformation process. 

Regeneration from protoplasts varies from species to species of 
plants, but generally a suspension of protoplasts is first made. In certain species, 
30 embryo formation can then be induced from the protoplast suspension, to the stage of 
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ripening and germination as natural embryos. The culture media will generally 
contain various amino acids and hormones, necessary for growth and regeneration. 
Examples of hormones utilized include auxins and cytokinins. It is sometimes 
advantageous to add glutamic acid and proline to the medium, especially for such 

5 species as com and alfalfa. Efficient regeneration will depend on the medium, on the 
genotype, and on the history of the culture. If these variables are controlled, 
regeneration is reproducible. Regeneration also occurs from plant callus, explants, 
organs or parts. Transformation can be performed in the context of organ or plant 
part regeneration as, for example, described in Methods in Enzymology, Vol. 118 and 
10 Klee et al (1987, Annual Review of Plant Physiology, 38:467), which are 
incorporated herein by reference. Utilizing the leaf disk-transformation-regeneration 
method of Horsch et al (1985, Science, 227:1229, incorporated herein by reference), 
^iisks are cultured on selective media, followed by shoot formation in about 2-4 
weeks. Shoots that develop are excised from calli and transplanted to appropriate 

15 root-inducing selective medium. Rooted plantlets are transplanted to soil as soon as 
possible after roots appear. The plantlets can be repotted as required, until reaching 
maturity. 

In vegetatively propagated crops, the mature transgenic plants are 
propagated by the taking of cuttings or by tissue culture techniques to produce 
20 multiple identical plants. Selection of desirable transgenotes is made and new 
varieties are obtained and propagated vegetatively for commercial use. 

In seed propagated crops, the mature transgenic plants can be self- 
crossed to produce a homozygous inbred plant. The inbred plant produces seed 
: containing the newly introduced foreign gene(s). These seeds can be grown to 
25 1 produce plants that would produce the selected phenotype, e.g. , early flowering. 

Parts obtained from the regenerated plant, such as flowers, seeds, 
1 leaves, branches, fruit, and the like are included in the invention, provided that these 
' parts comprise cells that have been transformed as described. Progeny and variants, 
and mutants of the regenerated plants are also included within the scope of the 
3d invention, provided that these parts comprise the introduced 
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It will be appreciated that the literature describes numerous techniques 
for regenerating specific plant types and more are continually becoming known. 
Those of ordinary skill in the art can refer to the literature for details and select 
suitable techniques without undue experimentation. 

5 7.2. Characterization 

To confirm the presence of the heterologous nucleic acid in the 
regenerating plants, a variety of assays may be performed. Such assays include, for 
example, "molecular biological" assays well known to those of skill in the art, such 
as Southern and Northern blotting and PGR; a protein expressed by the heterologous 
10 DNA may be analysed by western blotting, high performance liquid chromatography 
or ELISA nptH) as is well known in die art 

Examples of various methods applicable to characterization of 
. transgenic plants are provided in Chapters 9 and 1 1 of PLANT MOLECULAR 
BIOLOGY A Laboratory Manual EcL M.S. Clark (Springer-Verlag, Heidelberg, 
15 1997), which chapters are herein incorporated by reference. 

So that the invention may be understood and more detail, the skilled 
person is directed to the following non-limiting examples. 

EXAMPLE 1 
Construction of a tissue-specific cDNA library 

20 Extraction of tissue-specific RNA 

Total RNA was extracted essentially according to the method of 
Chirgwin et al (1979, Biochemistry 18 (24):5294-5299), from 9-month old Pindar 
sugarcane plants field-grown at the BSES Pathology Farm at Eight Mile Plains in 
Brisbane, Queensland, Australia. Roots, sterns and leaves were harvested from three 

25 separate plants randomly chosen in the field. The tissues were washed and stripped 
of dead material and immediately quick-frozen in liquid nitrogen. Five to ten grams 
of frozen tissue were ground to a fine powder in liquid nitrogen and 40 mL of a 4 M 
guanidine isocyanate solution were added. After thawing the mixture, 140 nL of P~ 
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mercaptoethanol and 2 mL of a 10% saikosyl solution were added and thoroughly 
mixed. The cell debris was pelleted by centrifugation. The supernatant was then 
filtered through one layer of Miracloth (Calbiochem) and gently poured over a 5.7 M 
caesium chloride cushion. The RNA was spun down for 18 hours in a SW27 rotor at 

5 20°C (25,000 RPM). The supernatant was carefully removed from the tubes and the 
RNA pellet was resuspended in 400 fiL of DEPC-treated distilled water. After a 
phenol/chloroform extraction (phenol/chloroform/isoamyl alcohol 24/24/2), the 
RNA was precipitated by addition of 1/10 volume of sodium citrate (3M, pH 5.2) 
and 2.5 volumes of ethanol. The quantity of extracted RNA was determined by 

10 spectrophotometry (OD260nm) and the quality visualised by agarose gel 
electrophoresis (Sambrook et aL 1989). 

Isolation of tissue-specific mRNA 

The PolyATtract® mRNA Isolation Systems (Promega) was used to 
separate the poly(A) RNAs from the non-poly(A) RNAs. The system uses a 
15 biotinylated oligo(dT) primer to hybridise to the poIy(A) tail of the mature mRNAs. 
The duplexes are captured by streptavidin coupled to a paramagnetic particle and 
isolated using a magnetic stand. The mRNAs are then eluted directly by DEPC- 
treated dH 2 0. The yields of mRNA from total RNA w?rc 0.37% (leaf), 0.31% (root) 
and 0.27% (stem). 

20 cDNA synthesis 

Stem mRNAs (5jig) were used in the synthesis of double-stranded 
cDNAs with the TimeSavei® cDNA Synthesis Kit (Pharmacia Biotech). Hie first- 
strand cDNA was synthesized using an oligo(dT)i2-i8 primer. After synthesis of the 
second strand, EcoBUNotl adaptors were ligated to the cDNAs. 

25 cDNA library 

A stem-specific cDNA library was constructed in Lambda ZAP®n 
(Predigested Lambda ZAP®n/£ctfRI/CIAP Cloning Kit, Stratagene). Stem cDNAs 
(150 ng) were ligated to 1 \ig of predigested Lambda ZAP®n vector for 16 hrs at 
14°C. Half of the ligation was in vfcro-packaged (Gigapack® II Gold Packaging 
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Extract, Stratagene). Titration of the library was carried out using 1/500 and 1/5000 
of the phage-containing supernatant and XL 1 -Blue MRF* host bacteria. After the 
phage and the host cells were incubated together, top agar containing IPTG (2.5 
mM) and X-gal (4 g/L) was added and the mixture plated on NZY solid medium 

5 - plates. Hie titer was estimated at 715,000 white plaque forming units/jig of vector 
with 4.6% non-recombinant background plaques (blue plaques). The second half of 
the ligations was similarly packaged. The packaged phages were combined, the titer 
was estimated at 663,000 pfu/mL after allowing a 20% drop of the titer after a 4-day 
storage at 4°C. The cDNA library was amplified by plating 50,000 pfu/ 150 mm- 

10 NZY plate (six plates). After 8hrs incubation at 37°C, when the plaques were no 
more than 1 to 2 mm in diameter, the plates were overlaid with 8-10 mL of SM 
buffer (10 mM NaCl, 8 mM MgS0 4 .7H 2 0, 50 mM Tris-HCl pH 7.5 and 0.01% 
gelatin) and gently rocked overnight at 4°C. The bacteriophage suspension was 
recovered from each plate and pooled. Chloroform was added (5% final 

15 concentration) and the cell debris pelleted by centrifugation (10 min at 2000xg). The 
supernatant was recovered and chloroform added to 0.3% final concentration^ 
Aliquots were stored in 7% DMSO at -80°C. The titer of the amplified library was 3 
xl0 ,0 pfu/mL. 

EXAMPLE 2 

20 Differential screening of the stem cDNA library, isolation of Lambda clones 

Platine of the cDNA library 

The stem cDNA library was plated with E.coli host strain LE392, at a 
density of 8,000 -pfii/1 50 mm NZY plates (supplemented with 0.2% maltose). Ten 
plates were incubated at 37°C for 9 hrs, when the plaques reached a size of 1 mm or 

25 less in diameter without being confluent Plaque growth was stopped by transferring 
the plates to 4°C. The phage plaques were transferred onto nylon membranes (Nylon 
N+, Amersham) in triplicates: the first lift was in contact with the plaques for 45 
seconds, the second lift for 2 min and the third lift for 4 min. The phage DNA was 
then denatured by placing the membrane for 2 min on a 3MM chromatography paper 

30 (Whatman) saturated with denaturing solution (2M NaOH, 1.5M NaCl). The 
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membrane was then transferred onto a 3MM chromatography paper saturated with a 
neutralising buffer (0.5 M Tris-HCl pH 8.0, 1.5 M NaCl) for 5min and finally rinsed 
in a 0.2 M Tris-HCl pH 8.0 and 2xSSC buffer (0.2 M NaCl, 0.03 M sodium citrate, 
pH 7.0) for lmin. The phage DNA was then UV-cross-linked to the damp 
5 membranes (GS Gene Linker™ UV Chamber, Bio-Rad). The membranes were air- 
dried and stored until further use between sheets of 3MM chromatography papers. 

Sinele-stranded radiolabelled cDNAs (vrobes) 

Labelling reactions (30 jiL total volume) contained 2 (ig of mRNAs 
used as templates, 1 jiL of Rnasin® (Promega), 6 jiL of 5x buffer (250 mM Tris-HCl 

10 pH 8.3, 375 mM KCL, 15 mM MgCl 2 ), 2 jiL of dNTPs solution (10 mM each of 
dATP, dTTP and dGTP), 2 jiL of oligo(dT) (0.5 ngfjiL), 5 *iL of 32 PdCTP 
(10*iCi/|iL, Amersham) and 1 jiL of M-MLV Reverse Transcriptase (Life 
Technologies). The mRNAs were brought to 13 jiL with DEPC-treated water and 
denatured for 10 min at 65°C prior to the addition of the other components. The 

15 reaction was carried out at 37°C for 1 hr. The probes were purified through a 
polyacrylamide column (Biospin 30 column, Bio-Rad). 

Differential hybridisation 

The nylon membranes were pre-hybridised for 6 hrs at 42°C in a pre- 
hybridisation solution containing 50% deionised formamide, 5xSSC (0.5 M NaCl, 

20 0.075 M sodium citrate, pH 7.0), 5x Denhardfs solution (0.1% ficoll, 0.1% 
polyvinylpyrollidone and 0.1% bovine serum albumin), 0.1% sodium dodecyi sulfate 
(SDS) and 200 mg/mL denatured salmon sperm DNA. The radiolabelled probe was 
added directly to the pre-hybridisation solution after completion of the pre- 
hybridisation procedure. Sets of membranes wore hybridised as follows: the first lifts 

25 were hybridised to the leaf radiolabelled cDNAs, the second lifts were hybridised to 
the root radiolabelled cDNAs and the third lifts ware hybridised to the stem 
radiolabelled cDNAs. 
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Washing of the membranes and exposure to a X-rav film 

After a 20 hr hybridisation, the membranes were washed sequentially - ' ;.' ; 
in 2xSSC and 0.1% SDS solution for 15 min at 45°C, 2xSSC and 0.1% SDS solution &\ 
for 15 min at 50°C, 0.2xSSC and 0.1% SDS solution for 12min at 55°C and 0.2xSSC /fe< 
5 and 0.1%SDS solution for 12 min at 65°C. The membranes were wrapped in a plastic )£yK 
film and exposed for two days to X-ray film (X-Omat Diagnostic Film, Kodak). / v Mvi j 

Primary screening . ; .. . . .. ,, M -^ ; 

Identification of differentially expressed cDNA clones (stem-specific) 

Each set of films corresponding to the same initial plate was visually 
10 examined for differentially hybridising plaques: for example, the film 1.1 ;^v}?;? 

(corresponding to the plate 1, first lift, probed with the leaf radiolabelled cDNAs) ! 

was overlaid with the film 1.3 (corresponding to the plate 1, third lift, probed with 

the stem radiolabelled cDNAs). The tea sets of films were screened for plaques ; 

hybridising to the stem radiolabelled cDNAs but giving no signals (or faint signals) 
15 when probed by either the leaf or root radiolabelled cDNAs. This method allowed 

the identification of 44 plaques corresponding to potentially stem-specific cDNA 

clones. 

Identification of constitutivelv expressed cDNA clones 

The same method revealed many plaques that strongly hybridised to 
20 each of the three different independently applied radiolabelled cDNAs stem, 
root and leaf). Seven plaques were selected amongst many that fit the requirements. 

Secondary screening 

Identification of differentially expressed cDNA clones (stem-specific) 

The plaques identified during the primary screening were cored from 
25 the plates and eluted in 1 mL of SM buffer and 20 jiL of chloroform. The titers were , 
determined for each eluate. A top agar bacterial lawn was poured onto 100 mm-NZY 
plates (supplemented with 0.2% maltose). About 50 phages from each eluate were 
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spread onto 1/4 of the plate. Plaques were grown at 37°C for 8 his, then placed at 
4°C. Triplicate lifts were made from each plate (Nylon N+, Amersham) as described 
above. Pre-hybridisation, hybridisation and visual screening, carried out in the same 
manner as for the primary screening, confirmed 37 plaques with a possible stem- 
5 specific expression pattern. 

Identification of constitutivetv expressed cDNA clones 

Secondary screening confirmed all of seven tested constitutive clones. 

In vivo excision of the oBluescrivt SKf-) ohaeemid f rom the Lambda ZAP 27 vector 

The excision of the phagemid from the Lambda ZAP II vector was 

10 performed as described in the manual of the Lambda ZAP® U/EcoRI/GAP Cloning 
Kit (Stratagene). The ExAssist/SOLR system procedure results in SOLR colonies 
containing the pBluescript SK(-) double-stranded phagemid with a cloned DNA 
insert. The plasmids were isolated and restricted by either EcoRL or Notl to release 
the cloned cDNA which was purified from an agarose gel and used to generate 

1 5 radiolabeled probes by random priming for Southern and northern analyses. 

EXAMPLE 3 

Northern and Southern analyses of the cDNA clones 

Northern analysis 

The specificity of the expression of each clone was verified by 
20 northern analysis. Total RNA was extracted from different tissues (i.e. roots, 
; j different stem internodes and nodes, young emerging leaves, mature leaves and older 
; leaves) of field-grown sugarcane and maize plants and glasshouse-grown sorghum 
plants as described in Example 1. The RNA sample was mixed with 6 volumes of 
5 RNA sample buffer (60% deionised formamide, 25% of 37% formaldehyde and 15% 
25 ' of a lOxMOPS buffer containing 0.2 M MOPS pH 7.0, 50 mM sodium citrate and 5 
^ • mM EDT A pH 8.0) and 1 volume of RNA loading solution (50% glycerol, ImM 
. . EDTA pH 8.0, 2 mg/mL bromophenol blue and 2 mg/mL xylene cyanol). Prior to 

j loading the gel, the samples were heat-denatured at 65°C for 10 min. Equal amounts 

■ i 
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of total RNA (8 jig/lane) were loaded onto an agarose gel. The gel was run in lxTBE 
buffer (0.1 M Tris, 0.09 M Boric acid and 1 mM EDTA). Transfer of the RNA to a 
nylon membrane (Nylon N+, Amersham) was by capillary blotting (Sambrook et al. 
1989). Pre-hybridisation and hybridisation were carried out as described before, 

5 using probes labelled with 32 PdCTP (Amersham Rediprime DNA labelling system). 
Blots were washed at the stringency previously described, and scanned with a 
Phosphorlmager SI (Molecular Dynamics). FIGS 1 and 2 show the expression 
pattern of the cDNA clone c67 (SEQ ID NO: 1) that is preferentially expressed in the 
stem of field-grown sugarcane plants. The relative expression levels are 100 in the 

10 stem, 0.3-0.7 in the leaves and between 1.7 and 5 in the roots (depending on the 
cultivar). Within the stem, c67 is predominantly expressed in die more mature parts 
of the stem. FIGS. 3, 4, 5 and 6 show the stem-specific expression pattern of the 
cDNA clone c51 (SEQ ID NO: 2, 3 and 4). The c51 probe also reveals a homologous 
stem-specific cDNA in maize and sorghum plants (with some expression in the roots 

IS of glasshouse-grown sorghum plants). Figures 7, 8, 9 and 10 show the constitutive 
expression of the cDNA clone c32A (SEQ ID NO: 5 and 6) within the different parts 
of the sugarcane, maize, sorghum and rice plants that have been tested. 

Southern analysis 

DNA was extracted from sugarcane cultivars Pindar or Q117, 

20 digested by selected restriction enzymes, and electrophoresed (10 ng/lane) on an 
agarose gel in IxTAE buffer (40 mM Tris-acetate, 1 mM EDTA, pH 8.3). The DNA 
was then denatured for Ihr (2 M NaOH, 1.5 M NaCl), neutralised for lhr (0.5 M 
Tris-HCl pH8.0, 1.5 M NaCl) and equilibrated in lOxSSC prior to the transfer of the 
DNA onto the nylon N+ membranes (Amersham) by capillary blotting. The 

25 membranes were then treated as for the northern blots for pre-hybridisation, 
hybridisation to a radiolabeled probe, washes and detection by Phosphorlmager 
(Molecular Dynamics). Southern blot analyses of the clones c67 (SEQ ID NO: I), 
c51 (SEQ ID NO:2, 3 and 4) and c322 (SEQ ID NO:5 and 6) are shown respectively 
in FIGS. 11, 12 and 13. 
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EXAMPLE 4 

Screening of the stem-specific cDNA library 

The stem-specific cDNA library was screened as described above, 
using c51 (SEQ ID NO: 2) and c32A (SEQ ID NO: 5) as probes. After the secondary 
5 screening, one additional clone corresponding to c32A and two additional clones, 
corresponding to c51, were isolated (c322, c511 and c512, corresponding 
respectively to SEQ ID NOS: 6, 3 and 4). 

EXAMPLE 5 

Sequencing of the clones 
10 Plasmid inserts were sequenced using universal primers (T3, T7, SP6, 

M13 Reverse and M13 Forward) or custom-designed primers, and the ABI 
PRISM™ Big Dye™ Terminator Cycle Sequencing Ready Reaction Kit was used 
and the reactions loaded onto an ABI PRISM instrument. Sequences were obtained 
from both strands of the DNA. 

15 The nucleotide sequence of the cDNA clone c67 (976nt) is shown in 

figure 14 (SEQ ID NO: 1). FIG. 15 shows the nucleotide sequence alignment of the 
homologous cDNA clones c51, c511 and c512 (SEQ ID NO: 2, 3 and 4). FIGS. 16 
and 17 show the nucleotide sequences of the cDNA clones c32A and c322 
respectively (SEQ ID NO: 5 and 6). FIGS. 23, 24, 26, 29 and 31 show the nucleotide 

20 sequences of cDNA clones cl9 (SEQ ID NO: 7), c3 (SEQ ID NO: 8), cl8 (SEQ ID 
NO: 9), c53A (SEQ ID NO: 10) and c57 (SEQ ID NO: 1 1), respectively. In addition, 
FIGS. 18, 19,20 and 21 show respectively the nucleotide sequences of the promoter 
sequences 67p (SEQ ID NO: 12), 32A2P2 (SEQ ID NO: 13), 32A6P2 (SEQ ID NO: 
14) and 51p (SEQID NO: 15). 
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EXAMPLE 6 

Promoter recovery by iPCR 

Primer design for inverse PCR reaction 

Two primers designed from the sequence of the cDNA clone c67 
5 (SEQ ED NO: 1) for use in an inverse PCR reaction are 67-1 (5' 
AGGACGCTTCTCAGATTGGC 3'; SEQ ID NO:24) and 67-2 (5 1 
GTATTCGGAGTTGCAGGTCG 3'; SEQ ID NO:25). 

iPCR 

Genomic DNA from sugarcane (cultivar Ql 1 7) was isolated through a 

10 CsCl gradient and restricted by SacL The enzyme was heat-inactivated (65°C for 10 
min) and 600 ng of restricted DNA were recircularised in a total reaction volume of 
400 \xL containing 9 Weiss units of T4 DNA ligase (New England Biolabs). After 
completion of the ligation, the reaction was extracted with phenol/chloroform and 
the DNA precipitated by addition of ethanol. The PCR reaction was carried out in a 

15 final volume of 50 jiL comprising 150 ng of circularised DNA, 10 jiL of 5x buffer 
(for a final concentration of 60 mM Tris-S0 4 pH9.1, 18 mM (NILt)2S04 and 1.8mM 
MgCU), 20 ng of primer 67-1, 20 ng of primer 67-2, 5 \iL of a dNTPs solution 
(2mM each) and 1.5 jiL of polymerase (ELONGASE™ Enzyme Mix, Life 
Technologies). After an initial 30 s at 94°C, 35 cycles were performed consisting of 

20 1 min at 94°C, 1 min at 58°C and 10 min at 68°C. A second iPCR reaction was 
performed in the same conditions using 0.1 jiL of the first reaction as template. 
Products of the iPCR reactions were run on an agarose gel and the DNA purified 
from the gel (Bresaclean, Bresatec). The purified DNA was blunt-ended using 
Klenow and restricted by Sad. The two DNA products were cloned into pGEM®-4Z 

25 (Promega) digested by Smal and Sacl. Sequencing using the T7 primer identified the 
plasmid clone containing a DNA region upstream of the primer 67-1 (plasmid pG4- 
67pro). 
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Fusion of the promoter rezion to a reporter gene 

The full nucleotide sequence (to the ATG start codon) of the promoter 
region obtained by the inverse PCR method is shown in FIG. 18 (SEQ ID NO: 12). 
The GUS gene and nos terminator were excised from plasmid pBI10l.3 (Clontech) 

5 as an Xbal-EcoRI fragment and ligated into the Xbal and EcoRl sites of pBluescript 
(Stratagene) to form plasmid pBS-GUS-3. A partial Sad digest, followed by a 
BamHL digest were carried out on the pBS-GUS-3 plasmid. Within these two sites 
was inserted a SacI-BamHL DNA fragment from the plasmid pG4-67pro to obtain 
plasmid p67G, with the GUS coding region translationally fused to the ATG start 

10 codon at the 3' end of the 67pro. 

EXAMPLE 7 

Promoter recovery by screening of a genomic library 

Pindar genomic library 

A Pindar sugarcane genomic library made in LambdaGEM®-l 1 
15 (Promega) was plated with E.coli host strain KW251 on 14 NZY plates (150 mm) at 
a density of 24,000 pfu/plate. Duplicate lifts ori nylon membranes (N+, Amersham), 
prehybridisation and hybridisation were carried out as for the screening of the cDNA 
library. 

The probe was made from cDNA c322 (SEQ ID NO: 6) excised from 
20 the plasmid by an Ec6Rl-EcoOl09I digest and randomly radiolabeled (Rediprime 
DNA labelling system, Amersham). Two genomic clones were isolated after a 
secondary screening: X32A2 and X32A6. The Lambda DNA was isolated from liquid 
lysates (Ausubel et al 1990), restricted by different enzymes (Psfl, EcoRV, EcoRL, 
/findffl, NcoU BstXl and EcoOl09T) and the fragments were analysed by Southern 
25 blotting. From both Lambda clones, a PstI fragment containing the promoter region 
was cloned into pZErO-2 (Invitrogen). Sequences from the resulting plasmids (pZ- 
32A2P2 and pZ-32A6P2) are shown in FIGS. 19 and 20 (SEQ ID NO: 13 and 14). 
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H32-8560 genomic library 

A sugarcane genomic library (Albert et ah 1992, Plant Mol. Biol. 
20:663-671) made in Lambda EMBL4 (Stratagene) was plated with E.coli host strain 
Y1090r- on ten 150 mm-plates at a density of 35,000 pfu/plate. Duplicate lifts on 
5 nylon membranes (N+, Amersham), prehybridisation and hybridisation were carried 
out as for the screening of the cDNA library. A Nhel DNA fragment from the cDNA 
c51 (SEQ ID NO: 2) was used to generate a radiolabelled probe (Rediprime DNA 
labelling system, Amersham). One clone was isolated after tertiary screening (AH51- 
6). The DNA fragments from an EcoRI and a PstI restriction digest corresponding to 
10 hybridising signals from die clone XH51-6 were cloned into the pZErO™-2 cloning 
vector (Invitrogen). These clones were designated pZ-H51-6E (EcoRI) and pZ-H51- 
6P1 (PstI). The pZ-H51-6E insert is 1052 bp long and contains the ATG start codon 
and 960 bp of sequence upstream of the ATG. The pZ-H51-6Pl insert is 3.3 kb long 
and contains 670 bp of sequence upstream of the ATG. The 240 bp EcoKl-PstI 
15 fragment from pZ-H51-6E was used to re-probe the Southern blot of the AH51-6 
restriction digests. A PstI DNA fragment hybridising to this probe was cloned into 
pZErO (pZ-H51-6P2). The sequence of pZ-H51-6P2 corresponds to the sequence 
immediately upstream of the sequence from pZ-H51-6Pl. The sequence of the 
promoter region of the AH51-6 clone is shown in FIG. 21 (SEQ ID NO: 15). 

20 Fusion of the promoter reeion to a reporter gene 
32A6 

A primer was designed for amplification of the promoter region from 
pZ-32A6P2 in conjunction with the M13-Reverse primer (5* 
CAGGAAACAGCTATGAC 3'; SEQ ID NO:26). This primer sequence (primer 

25 32A4) is homologous to the region surrounding the ATG and contains a BamHl 
restriction site at its 5' end (in bold). The sequence is 5' 
TAGGATCCTACCATCTTGAGATGCGG 3' (SEQ ID NO:27). The PCR reaction 
was carried out with a proofreading DNA polymerase from the ELONGASE 
Enzyme Mix (Life Technologies). The PCR product was digested by PstI and 

30 BamHl and purified. The GUS gene and nos terminator were excised from plasmid 



WO 01/18211 



PCT/AU00/01033 



59 

pBIlOl.l (Clontech) as a Xbal-EcoBI fragment and ligated into the Xbal and EcoRI 
sites of the pGEM®-4Z (Promega) to fonn plasmid pG4-GUS-l. This plasmid was 
restricted by BamHl and Pstl and the PGR product ligated into these two sites to 
fonn a translational fusion of the 32A6 promoter to the GUS reporter gene. This 
5 plasmid construct was designated p32A6G and its fusion region confirmed by 
sequencing with the primer Gus-2 (5' CGCTGATCAATTCCACAG 3'; SEQ ID 
NO:28). 

32A2 

The primer 32A4 was also used to amplify the promoter region of pZ- 
10 32A2P2 in conjunction with the M13-Reverse primer. The PCR reaction was carried 
out with a proofreading DNA polymerase from the ELONGASE Enzyme Mix (Life 
Technologies). The PCR product was restricted by BamHl and PsiU purified and 
ligated into the BamHl and Pstl sites of pG4-GUS-l to form p32A2G. The fusion 
region was confirmed by sequencing p32A2G with the primer Gus-2. 

15 57 

The 51 promoter region is present in two plasmids (pZ-H51-6Pl and 
pZ-H51-6P2). Fusion of the two parts of the promoter was carried out as follows: 
pZ-H51-6Pl was partially digested with PstL The full-length linear plasmid was gel 
purified. The far upstream promoter region in pZ-H51-6P2 was cut from the plasmid 
20 pZ-H51-6P2 as a Pstl fragment and ligated into the Pstl site of pZ~H51-6Pl. The 
resulting plasmid was designated pZ-H51-6PlP2. 

The GUS gene and nos terminator were excised from plasmid 
pBI101.3 (Clontech) as aXbal-EcoRI fragment and ligated into the Xbal-EcdRI sites 
of the pGEM®-4Z (Promega) to form plasmid pG4-GUS-3. Two translational 
25 fusions of the 51 promoter were made in pG4-GUS-3. An oligonucleotide primer 
(51G) homologous to the region surrounding the ATG start codon was designed to 
contain a BamHl restriction site (in bold) in its 5' end. The sequence of 51G is 5 1 
ATGGATCCCCATTGGTGACGATCAGAAG 3' (SEQ ED NO:29). Using 51G and 
M13-Reverse primers, the promoter region was amplified from pZ-H51-6E with a 
30 proofreading DNA polymerase from the ELONGASE Enzyme Mix (Life 
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Technologies). The PCR product was then restricted with BamHL and ligated into the 
BamHL site of pG4-GUS3, generating the plasmid pSISG (959bp of promoter region 
to the ATG). The plasmid pSISG was restricted by Ncol and Sail and a Sail - Ncol 
fragment from the plasmid pZ-H51-6PlP2 was inserted, resulting in the plasmid 
5 p5 1LG which contains 2133 bp of promoter region (to the ATG). 

EXAMPLE 8 

Functionality of the promoter: sugarcane transformation 

Particle bombardment 

Plasmids (GUS constructs) were isolated by rapid alkaline extraction, 

10 dissolved in TE buffer and checked for quality (intactness and absence of genomic 
DNA and RNA contamination) by. gel electrophoresis, quantified by 
spectrophotometry (Sambrook et al 1989), and not linearised before precipitation' 
onto tungsten particles. Tungsten (Bio-Rad M10) was purchased in small quantities, 
sterilised by washing in ethanol and sterile waiter, and stored in water at -20°C. 

15 Tungsten stored for several years as a dry powder may become unsuitable for gene 
transfer. 

Precipitation reactions were performed by adding the following 
reagents at 4°C in the listed order to a 1.5 mL microfiige tube: 5 jiL selectable 
marker plasmid DNA (1 mg/mL pEmuKN unless specified), 5 \lL non-selected 

20 plasmid DNA (1 mg/mL unless specified), 50 jiL tungsten (M10, 100 mg/mL 
thoroughly suspended in water), 50 fiL CaCl 2 (2.5 M in water), 20 jiL spermidine 
(100 mM free base in water). The preparation was mixed immediately after addition 
of each reagent, with minimal delay between addition of CaCh and spermidine. The 
tungsten was then allowed to settle for 5 min on ice, before removal of 100 jiL of 

25 supernatant and resuspension of the tungsten by running the tube base across a tube 
rack. Suspensions were used within 15 min, at a load of 4 jiL/bombardment, with 
resuspension of particles immediately before removal of each aliquot Assuming the 
entire DNA is precipitated during the reaction, this is equivalent to 1.3 \ig 
DNA/bombardment, on 667 ng tungsten/bombardment. 



WO 01/18211 PCT/AU00/01033 

61 

Particles were accelerated by direct entrainment in a helium gas pulse, « 
through die constriction of a syringe filter holder into target tissues in a vacuum 
chamber, as described previously (Bower et al 1 996, supra). 

Target tissues, selection and plant regeneration 

5 Embryogenic callus cultures of sugarcane cultivar Ql 1 7 were initiated 

and bombarded, with osmotic treatment, followed by escape-free selection for 
resistance to 45 mg/L G418, and regeneration to plants as described previously 
(Bower et al. 1996, supra). 

Potted plants were grown at 28 ± 2°C in a containment glasshouse 
10 without artificial lighting. 

EXAMPLE 9 

Analysis of transgenic sugarcane plants 

Southern analysis 

Regenerated plants were confirmed transgenic by Southern analysis, 
15 using the GUS coding region as a probe. 

Histochemical GUS analysis 

Histochemical detection of GUS activity was done using the substrate 
5-bromo-4-chloro-3-indolyl glucuronide (X-Gluc; Jefferson 1987, Plant Molecular 
Biology Reporter 5:387-405). Sugarcane tissues (leaf segments, roots and stem 

20 slices) were immersed in assay buffer (0.05% X-Gluc, 50 mM sodium phosphate pH 
7.0), vacuum infiltrated twice for lOmin and incubated for 16hrs at 37°C. Following 
this incubation, green tissues were destained with ethanol and blue-stained tissues 
identified. FIG. 22 shows the GUS activity in the stem of three transgenic plants 
transformed with plasmid p67G. This is the first demonstrated recovery of a 

25 functional promoter from sugarcane and retention of the promoter activity and 
specificity when re-introduced into sugarcane, which has so far shown a marked 
propensity to silence transgenes. 
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The present invention has been described 
embodiments found or proposed by the present 

modes for the practice of the invention. Those of skill in the art will appreciate that, 
in light of the present disclosure, numerous modifications and changes may be made 
in 
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CLAIMS 

>/: — 

1. An isolated nucleic acid comprising a nucleotide sequence which 
corresponds to a promoter region of a transcribable DNA sequence that is 
hybridizable to a probe or primer derivable from a polynucleotide sequence selected 
5 from the group consisting of: - 

(a) the polynucleotide sequence set forth in FIG. 14 [SEQ ID NO: I]; 

(b) the polynucleotide sequence set forth in FIG. 1 5 under designator cSl 
[SEQ ID NO: 2]; 

(c) the polynucleotide sequence set forth in FIG. 15 under designator 
10 c511 [SEQIDNO:3]; 

(d) the polynucleotide sequence set forth in FIG. 15 under designator 
c512 [SEQ ID NO: 4]; 

(e) the polynucleotide sequence set forth in FIG. 16 [SEQ ID NO: 5]; 
(£) the polynucleotide sequence set forth in FIG. 17 [SEQ ED NO: 6]; 

1 5 (g) the polynucleotide sequence set forth in FIG. 23 [SEQ ID NO: 7]; 

(h) the polynucleotide sequence set forth in FIG. 24 [SEQ ID NO: 8]; 

(i) the polynucleotide sequence set forth in FIG. 26 [SEQ ID NO: 9]; 

(j) the polynucleotide sequence set forth in FIG. 29 [SEQ ID NO: 10]; 
and 

20 (k) the polynucleotide sequence set forth in FIG. 31 [SEQ ID NO: 11]. 

2. An isolated nucleic acid comprising a nucleotide sequence which 
corresponds to a promoter region of a transcribable DNA sequence selected from the 
group consisting of: - 

(a) the polynucleotide sequence set forth in FIG. 14 [SEQ ID NO: 1]; 
25 : (b) the polynucleotide sequence set forth in FIG. 15 under designator c51 

[SEQ ID NO: 2]; 

(c) the polynucleotide sequence set forth in FIG. 15 under designator 
c511 [SEQ ID NO: 3]; 

(d) the polynucleotide sequence set forth in FIG. 15 under designator 
30: c512[SEQIDNO:4]; 

i : (e) the polynucleotide sequence set forth in FIG. 16 [SEQ ID NO: 5]; 

! : (f) the polynucleotide sequence set forth in FIG. 17 [SEQ ID NO: 6]; 
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(g) the polynucleotide sequence set forth in FIG. 23 [SEQ ID NO: 7]; 

(h) the polynucleotide sequence set forth in FIG. 24 [SEQ ID NO: 8]; 

(i) the polynucleotide sequence set forth in FIG. 26 [SEQ ID NO: 9]; 

(j) the polynucleotide sequence set forth in FIG. 29 [SEQ ID NO: 10]; 
5 and 

(k) the polynucleotide sequence set forth in FIG. 31 [SEQ ID NO: 1 1]. 

3. The isolated nucleic acid of Claim 1, wherein the nucleotide sequence is of a 
length in the range 100 bp to 4 kb. 

4. The isolated nucleic acid of Claim 3, wherein the nucleotide sequence is of a 
10 length in the range 1 kb to 4kJ>. 

5. The isolated nucleic acid of Claim 1, wherein the isolated nucleic acid is 
capable of directing transcription in many or all tissues of a plant 

6. The isolated nucleic acid of Claim 1, wherein the isolated nucleic acid is 
capable of directing transcription preferentially in stem tissue of a plant 

15 7. The isolated nucleic acid of Claim 1, wherein the isolated nucleic acid is 
capable of directing transcription preferentially in meristem tissue of a plant. 

8. The isolated nucleic acid of any one of Claims 5-7, wherein the plant is a 
monocotyledonous plant 

9. The isolated nucleic acid of Claim 8, wherein the monocotyledonous plant is 
20 sugarcane. 

10. An isolated nucleic acid which comprises a nucleotide sequence selected 
from the group consisting of: SEQ ID NO: 12 , SEQ ID NO:13, SEQ ID NO: 14 and 
SEQIDNO:15. 

11. An isolated nucleic acid which comprises a biologically-active fragment of 
25 any one of SEQ ID NOS: 12, 13, 14 or 15. 

12. An isolated nucleic acid which has at least 60% sequence identity with any 
one of SEQ ID NOS: 12, 13, 14 or 15. 

13. An isolated nucleic acid which is capable of hybridising to any one SEQ ID 
NO: 12, 13, 14 or 15 under at least low stringency conditions. 

30 14. A chimeric gene comprising the isolated nucleic acid of any one of Claims 1 , 
2, 10, 1 1, 12 or 13 operably linked to a heterologous nucleic acid. 
15. The chimeric gene of Claim 14, characterized in that said isolated nucleic 
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acid is capable of directing transcription of the heterologous nucleic acid in many or 
all tissues of a plant 

16. The chimeric gene of Claim 14, characterized in that said isolated nucleic 
acid is capable of directing transcription of the heterologous nucleic acid 

5 preferentially in stem tissue of a plant. 

17. The chimeric gene of Claim 14, characterized in that said isolated nucleic 
acid is capable of directing transcription of the heterologous nucleic acid 
preferentially in meristem tissue of a plant 

18. An expression vector comprising the isolated nucleic acid of any one of 
10 Claims 1,2, 10, 11, 12 or 13. 

19. The expression vector of Claim 14, wherein the isolated nucleic acid 
comprises a nucleotide sequence selected from the group consisting of SEQ ID NOS: 
12, 13, 14 or 15. 

20. The expression vector of Claim 18 or 19, further comprising a heterologous 
1 S nucleic acid operably linked to said isolated nucleic acid. 

21. The expression vector of Claim 20, wherein the heterologous nucleic acid 
encodes a molecule which inhibits the expression of a target gene. 

22. The expression vector of Claim 21, wherein the molecule is an antisense 
RNA. 

20 23. The expression vector of Claim 21, wherein the molecule is a ribozyme. 

24. The expression vector of Claim 21, wherein the molecule is capable of 
inhibiting expression of the target gene by co-suppression. 

25. The expression vector of Claim 20, wherein the heterologous nucleic acid 
encodes a polypeptide. 

25 26. A method of transforming a plant cell or tissue, comprising the step of 
introducing into said plant cell or tissue the isolated nucleic acid of any one of 
Claims 1,2, 10, 11, 12 or 13. 

27. A transformed plant cell or tissue comprising the isolated nucleic acid of any 
one of Claims 1, 2, 10, 1 1, 12 or 13. 
30 28. A transgenic plant comprising the isolated nucleic acid of any one of Claims 
1,2,10, 11, 12 or 13. 

29. The transgenic plant according to Claim 28, wherein the transgenic plant has 
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an altered phenotype. 

30. A transgenic monocotyledonous plant according to Claim 29. 

31. A transgenic sugarcane plant according to Claim 30. 

32. A progeny plant produced from the transgenic plant of Claim 28 

5 33. A cell, tissue or seed obtained from the plant of any one of Claims 28-31. 

34. Plasmid pG4-67pro deposited with AGAL on August 30, 1999 under 
accession number NM99/05995. 

35. Plasmid pZ-32A2P2 deposited with AGAL on August 30, 1999 under 
accession number NM99/05994. 

10 36. Plasmid pZ-32A6P2 deposited with AGAL on August 30, 1999 under 
accession number NM99/05993. 

37. Plasmid pZ-H51-6PlP2 deposited with AGAL on August 30, 1999 under 
accession number NM99/05992. 

38. An isolated nucleic acid comprising a polynucleotide sequence selected from 
15 the group consisting of: - 

(a) the polynucleotide sequence set forth in FIG. 14 [SEQ ID NO: 1]; 

(b) the polynucleotide sequence set forth in FIG. 15 under designator c51 
[SEQ ID NO: 2]; 

(c) the polynucleotide sequence set forth in FIG. 15 under designator 
20 c511[SEQIDNO:3]; 

(d) the polynucleotide sequence set forth in FIG. 15 under designator 
c512[SEQIDNO:4]; 

(e) the polynucleotide sequence set forth in FIG. 16 [SEQ ID NO: 5]; 

(f) the polynucleotide sequence set forth in FIG. 17 [SEQ ID NO: 6]; 
25 (g) the polynucleotide sequence set forth in FIG. 23 [SEQ ID NO: 7]; 

(h) the polynucleotide sequence set forth in FIG. 24 [SEQ ID NO: 8]; 

(i) the polynucleotide sequence set forth in FIG. 26 [SEQ ID NO: 9]; 

(j) the polynucleotide sequence set forth in FIG. 29 [SEQ ID NO: 10]; 
and 

30 (k) the polynucleotide sequence set forth in FIG. 31 [SEQ ID NO: 1 1]. 

39. A polypeptide encoded by an open reading frame of the isolated nucleic acid 
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according to Claim 38. 

40. The polypeptide of Claim 39, wherein the polypeptide comprises an amino 
acid sequence selected from the group consisting of: 

(i) the amino acid sequence set forth in FIG. 14 [SEQ ID NO: 16]; 
5 (ii) the amino acid sequence set forth in FIG. IS [SEQ ID NO:17]; 

(iii) the amino acid sequence set forth in FIG. 26 [SEQ ID NO: 18]; 

(iv) the amino acid sequence set forth in FIG. 29 [SEQ ID NO: 19]; 

(v) the amino acid sequence set forth in FIG. 3 1 [SEQ ID NO:20]; 

(vi) the amino acid sequence set forth in FIG. 24 [SEQ ID NO:21]; 

10 (vii) the amino acid sequence set forth in FIG. 16 [SEQ ID NO:22]; and 

(viii) the amino acid sequence set forth in FIG. 17 [SEQ ID NO:23]. 



15 

DATED this first day of September, 2000 

THE UNIVERSITY OF QUEENSLAND 
By their Patent Attorneys 
FISHER ADAMS KELLY 
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Fig. 10 
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Fig. 11 





WO 01/18211 



12/36 



PCT/AU00/01033 



Fig. 12 




WO 01/18211 PCT/AU00/01033 

14/36 = 



Fig. 14 .. Jfi'.-.'cSX- 



1 


TGAAATAAAC 


GGTAGCTGCC ATAACTAGTA 


CAATGGCCAA 


TCTGAGAAGC 


,50 , 




";'7«'v. v • 


■ s. » 








M A N 


L R S 




;• ; f ' : . S 

'-■If ; 


» • v'j V 
»'* 


51 


GTCCTAGCTG 


TGAGCCTAGC CGTGGCACTT 


TTCGCAGTTG 


CTCCTGCATC 


100 




f " 1 • f 




V L A V 


S LAVAL 


F A V A 


(P A S 


. - » "• 






Vf* >^ 


101 


GTTCGCACTG 


GATGAGAAAG AGTTGCACCT 


GAGTTTGTAC 


TTAAACCAGA 
LifN Q T 


150 




, 






F A L 


D E K E L H L \ 


S L .Y 






• • i •> - •■ 




151 


CATACAGCGG AAACGGCCTT AACCAGGCGG 


TGGTGGTCGA ACCAGGCCTA 


200 




• -• *».'•.' .= 






Y S G 


N G L N Q A V 


V V -E 


P G L 










201 


CCTGGGGAGT 


TCGGCAACAT CGCCGTCCAG 


GACTGGCCTG 


TGACCAATGG 


250 










P G E F 


G N I A V Q 


D W P V 


T N G 




, • . 






251 


GGAAGGTAGC 


GACGCAACCG TCGTTGGACG 


TGCACAGGGC 


ATCCAGTTCA 


300 


> 








E G S 


D A T V V G R 


A Q G 


i$:Q f.v.- k 






A 


301 


AACGAAGCGA 


GAGGAACGAC CAAGCCTGGT 


ATACCACCTT 


GACCATAGTG 


350 










P S Q 


R N D Q A W Y 


T T L 


T I V 




. ■■ ; » V/ 




351 


TTCGAGAACA 


CGAGCCTCAA GGGATCCACG 


CTTCAGATGA TGGGTTACAT 


400 








F E N T 


S L K G S T v 


L Q M M 


G Y I 








.-■::: ; .f 


401 


CCCACAAGAT 


GGTCAGTGGA GCATTTTTGG AGGAACTGGA CAACTTACGA 


450 










P Q D 


G Q W S I F G . 


G T G 


Q L T M 










451 


TGGCACGCGG 
A R G 


TGTTGTGAAC CACAAGGTTG 
V V N H K V V 


TGCGCCAAAC 
R Q T 


CAATGGCGGG 
N G G 


500 








501 


AGGATGTATA 
R M Y K 


AGATCAACAT ACATGCCTTC 
INI H A F 


TATACCCCCC 
Y T P L 


TGGGCGCTTG 
G A S 


550 




>V-;CV/:/:- 




551 


TAGCAACTGT 


GGGATTAACC TTAGGCGCTT 


GGACTTCGAC 


GCTTGATCGA 


600 


; * I - > 








S N C 


G I N L R R L 


D F D 


A * 










601 


CTAGCGCGGA 


CTACAACAGG AGGACCGTGT 


TCTTCGTCGA 


CGCTTAATGC 


650 








651 


ATGGAAACTT 


CCCACGGGGG ACCGTGTTCT 


TCATGGACGC 


TAGACCAACC 


700 




. » • ! " V- "r " 1 . 


. I 


70i 


ATAATTTCTT 


TTCCGTTTGT ACTGTCAACA AATATAAATA TGTAAAGCAT 


750 




■- ",'*.**. * 




751 


AAATCCGAAC 


TGTATTCGGA GTTGCAGGTC 


GTCGTTGCCC 


CTGCCTTATG 


800 




• , - 7 ' : 




801 


GCCTGCACTG 


TACATGTACA TGTTTCTGTC 


AAGTTCTGCG 


AGTATTTTAA 


850 






851 


GTAATAAATA 


AAGTGGTTGG TTTCACGGTT 


TAAAAAAAAA 


AAAAAAAAAA 


'900 








901 


AAAAAAAAAA 


AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 


AAAAAAAAAA 


, 950 






- » V j . 


951 


AAAAAAAAAA 


AAAAAAAAAA AAAAAA 976 








'. , \ \ \ si . 





WO 01/18211 PCT/AU00/01033 

15/36 



Fig. 15 



1 50 
c51 TCGAATGCTC GATCGATCCC ACTCTCAGCT GATCGCTCAC TCTTGCAGCT 

C511 TGCTC GATCGATCCC ACTCTCAGCT GATCGCTCAC TCTTGCAGCT 

c512 TCTC GATCGATCCC ACTCTCAGCT GATCGCTCAC TCTTGCAGCT 

51 100 
C51 CGATCAGTCT TAGCTCTAGC CTCTAGCTAG CCAACTAGCC GCTCCTTCGT 
c511 CGATCAGTCT TAGCTCTAGC CTCTAGCTAG CCAACTAGCC ACTCCTTCGT 
c512 CGATCAGTCT TAGCTCTAGC CTCTAGCTAG CCAACTAGCC ACTCCTTCGT 

101 150 
C51 GTAGCCATCA GCCTTCTGAT CGTCACCAAT GGCCACTGCC GAGGTCCAGA 
c511 GTAGCCATCA GCCTTCTGAT CGTCACCAAT GGCCACTGCC GAGGTCCAGA 
c512 GTAGCCATCA GCCTTCTGAT CGTCACCAAT GGCCACTGCC GAGGTCCAGA 
aa MATAEVQT 

151 200 
c51 CCCCGACCGT CGTGGCGACC GAGGAGGCGC CCGTGGTGGA GACGCCGGCG 
c511 CCCCGACCGT CGTGGCGACC GAGGAGGCGC CCGTGGTGGA GACGCCGGCG 
C512 CCCCGACCGT CGTGGCGACC GAGGAGGCGC CCGTGGTGGA GACGCCGGCG 
aa PTV VAT EEAP V V E T P A 

201 250 
C51 CCGGCCGTCG TGCCCGAGGA GGCTGCCCCC GCCCCCGCCG AGGCTGAGAC 
C511 CCGGCCGTCG TGCCCGAGGA GGCTGCCCCC GCCCCCGCCG AGGCTGAGCC 
c512 CCGGCCGTCG TGCCCGAGGA GGCTGCCCCC GCCCCCGCCG AGGCTGAGCC 
aa PAVV PEE A A P APAE AEP 



251 300 
C51 GGCGGCCGTG CCCGAGGAGG CTGCCCCCGC CGAGGCCAAG GTGGAGGAGC 
c511 GGCGGCCGTG CCCGAGGAGG CTGCCCCCGC CGAGGCCAAG GTGGAGGAGC 
c512 GGCGGCCGTG CCCGAGGAGG CTGCCCCCGC CGAGGCCAAG GTGGAGGAGC 
aa AAV P EEA A PA EAK VEEP 

301 350 
C51 CTGCCGCCCC GGCGGAGCCT GAGCCTGCCG CCGCTGAGCC CGAGGCCGAG 
c511 CTGCCGCCCC GGCGGAGCCT GAACCTGCCG CCGCTGAGCC CGAGGCCGAG 
c512 CTGCCGCCCC GGCGGAGCCT GAGCCTGCCG CCGCTGAGCC CGAGGCCGAG 
aa A A P AEP EPAA AEP EAE 



351 

c51 CCTGCCGCCG CGGAGCCGGA 
c511 CCTGCCGCCG CGGAGCCGGA 
c512 CCTGCCGCCG CGGAGCCGGA 
aa P A A A . E P E 



400 

GGCGGCCCCT GCAGCCGCGG CGGAGGAAGA 
GGCGGCCCCT GCAGCCGCGG CGGAGGAAGA 
GGCGGCCCCT GCGGCCGCGG CGGAGGAAGA 
A A P A A A A EEE 



401 450 
C51 GGCGCCAAAG GAGGCGGAGC CGGCGGCGGT TGAGGAGGTG AAGGAGGAGG 
C511 GGCGCCAAAG GAGGCGGAGC CGGCGGCGGT TGAGTAGGTG AAGGAGGAGG 
C512 GGTGCCAAAG GAGGCGGAGC CGGCGGCGGT TGAGGAGGTG AAGGAGGAGG 
aa Atv»P K EAEP AAV EEV K E E E 

451 . 500 

c51 AGGCGGCGGC GCCCGCTGCC GAGACAGAGC CGGCGGCCGC CGAGCCCGAG 
C511 AGGCGGCGGC GCCCGCTGCC GAGACAGAGC CGGCGGCCGC CGAGCCCGAG 
C512 AGGCGGCGGC GCCCGCTGCC GAGACAGAGC CGGCGGCCGC CGAGCCCGAG 
aa AAA P A A ETEP AAA EPE 
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Fig. 15 cont'd 



501 550 
C51 GTTGCTGCTC CTGCTGCCTC CGAGGAGCCC ACCGCGGCCG AGCCGGCCGC 
c511 GCTGCTGCTC CTGCTTCCTC CGAGGAACCC GCCGCGGCCG AGCCGGCCGC 
C512 GCTGCTGCTC CTGCTGCCTC CGAGGAGCCC ACCGCGGCCG AGCCGGCCGC 
aa V ( A)A AP A A S EEP T AAE P A A 

551 600 
C51 CGCGGAGCCC GAGAAGGCCA GCGAGTGAGG CCGTGCGCGC GCGCAGCGGC 
c511 CGCGGAGCCC GAGAAGGCCA GCGAGTGAGG CCGT. .GCGC GCGCAGCGGC 
C 512 CGCGGAGCCC GAGAAGGCCA GCGAGTGAGG CCGTGCGCGC GCGCAGCGGC 
aa A E P EKAS E * 

601 65Q 
c51 GGCGGCCAGG GGATCGGAGT GAGATGGCTC ATCGTGCGCG CGTGACGTAG 
C511 GGCGGCCAGG GGATCGGAGT GAGGTGGCTC ATCGTGCGCG CGTGACGTAG 
C512 GGCGGCCAGG GGATCGGAGT GAGATGGCTC ATCGTGCGCG CGTGACGTAG 

651 700 
C51 CGGCGTGGCA GTCGCGTGCG CGCGTACATG GCACGGTCTT TTGCTGTTCA 
C511 CGGCGTGGCA GTCGCGTTTG CGCGTACATG GCACGGTCTT TTGCTGTTCA 
C512 CGGCGTGGCA GTCGCGTGCG CGCGTACATG GCACGGTCTT TTGCTGTTCA 

701 750 
C51 GTACGCTACG GTTACTGGCG TTTC . . TGTG TGTGTGTCGT AGCTCGGTAC 
c511 GTACGCTACG GTTACTGGCG TTTCTGTGTG TGTGTGTCGT AGCTCGGTAC 
c512 GTACGCTACG GTTACTGGCG TTTC . . TGTG TGTGTGTCGT AGCTCGGTAC 

751 800 
c51 GTAGCTAGCA GTGGCTGTCG CGTGGACTGT CCGGTGTGGA CTGCGAGTGA 
C511 GTAGCTAGCA GTGGCTGTCG CGTGGACTGT CCGGTGAGGA CTGCGAGTGA 
C512 GTAGCTAGCA GTGGCTGTCG CGTGGACTGT CCGGTGTGGA CTGCGAGTGA 

801 850 
c51 CGTGGGCTAA ATAAAGTGTG CGTCGTGTGG TGG ... TGGT GGTGGGGCCC 
C511 CGTGGGCTAA ATAAAGTGTG CGTGGTGGTG GGGCCCGGTG TCAGTGGCGC 
C512 CGTGGGCTAA ATAAAGTGTG CGTCGTGTGG TGG. . .TGGT GGTGGGGCCC 

851 900 
C51 GGTGCCCCGT GCCCTGCCGC TGCAGCTAGT CTGTCTTTT. . GTGTTTGTT 
C511 GGTGCCCCAT GCCCTGCCGC TGCAGCTTGT CTGTCTTTT. GTGTCGTGTT 
" C512 GGTGCCCCGT GCCCTGCCGC TGCAGCCTGT CTGTCTTTTT GTGTTGTGTT 

901 950 
c51 GGTTCCTGTT CTCTGTCGAT CTATATTTAT CCTCATGTTT GTATGGCAGT 
c511 GGTTCCTGTT CTCTGTCGAT CTATATTTAT CCTCATGTTT GTATGGCAGT 
C512 GGTTCCTGTT CTCTGTCGAT CTATATTTAT CCTCCA31 

951 979 

c51 TTGGGTATGG GTATCA49 

C511 TTGGGTATGG GTATCAAACA TGTTTGCGA30 
c512 
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1 TGCCAGGCTC ACCGAGGGGT GCTCCTTCCG TCGCAAGGGC GATTAGATCC 50;^; 
ARL TEGC SF R RKG D* V 

51 TGCTGTCTTC ATGGGAGAAG AGAGAAATTT GCTGTTTTAC TACCTTCCCA 100 > 

101 TGATATGTAC TCGTTGAGGA TTTTGTTGAT TATTATGGCT GTTTAGCGTG 150 

151 CCCTGGCAAT GCTTTTGTAA ACGTGCACTT TGCTTGAGCT TAGTGACATC 200 

201 TACTAAGGTG CTGTTTGGTT TTGCTAAGGG AGTGGCAATG GTAATGAAAT 250 

251 CAGTTGCTGG CGTTAATTGT TGGGGTTTTA AAAAAAAAAA AAAAAAAAAA 300 

301 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 350 

351 AAAAAAAAAA AAAAAAAAAA AAAAAAAA 378 
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Fig • 17 



1 GCCGCCGCTA CGCGCTCGCC TTTGCTCCCT CTGTTTCCCT TCCTCCATAC 50 
51 AGGCGCAGGC CAGGAACGCA CAAGGCGACC GCATCTCCAA GATGGTGCTG 100 

M V L 

y 

101 CAGAACGACA TCGACTTGCT CAACCCGCCG GCAGAGCTTG AGAAGCTAAA 150 
QNDI DL L NPP AELE KLK 

151 GCACAAGAAG AAGCGCCTCG TCCAGTCCCC CAACTCCTTC TTCATGGATG 200 
H K K K R L V Q S P N S F F M D V 

201 TCAAGTGCCA GGGCTGTTTC AGCATAACCA CTGTGTTCAG CCACTCCCAG 250 
K C Q G C F" SIT T V F S H S Q 

251 ACTGTGGTTG TGTGCCCAGG CTGCCAAACT GTTCTCTGCC AACCTACTGG 300 
T V V V C P G C Q T V L C Q P T G 

301 TGGGAAGGCC AGGCTCACCG AGGGGTGCTC CTTCCGTCGC AAGGGCGATT 350 
GKA RLTE GCS FRR KGD* 

351 AGATCCTGCT GCCTTTTAAT GGGAGAAGAG AGAAATTTGC TGT TT TACAA 400 

401 CCTTCCCATG ATATGTACTC GTTGAGGATT TTGTTAATTA TTATGGCTGT 450 

451 TTAGCTTGCC CTGTCAATGC TTTTGTAAAC GTGCACTTTG CTTGGTGCTG 500 

501 TTTGGTTTTG CCAAGGGATT GGCAATGGTA GTGAZVATCAG TTGCTGACGT 550 

551 TAAAAAAAAA AAAAAAA 567 
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Fig . 18 



1 GAGCTCTCAA TGGGAGGTGC TCGAAGACAT ATTACCCAAG TGTATGGCAA 50 

51 GATGTTTAGC TAGTAACTGA CTGATAGTGT AAACGATCTC CAATGGGGCA 100 

101 AGACATATTA CCTAAGGCCA GGCTGGTTTT TGCAAGTTCG AGTAGGATAT 150 

151 AGAGATTCTC GTGCGAGTTG TAAACGATCT CCAATGGGGC AAGACATCCT 200 

201 ACCCTATATA TAGTGAAGGG GCAGTAGCTG ATTGAGAATC AATCAATCAA 250 

251 GCACAATATA ATTTATTAAT TTTTTATTCA AACCCAATTT TTTTCCTTTT 300 ; 

301 CCAACCCTAA TTATAGTTCT CCTTTTGCCT CTAGGAGAAA TTGACGTGTT 35° ' 

351 CCGGGTATCC TGCTGAATCA AGAACAACCC TAGGTGCACC TGTCCCGATA 400 

401 GAGTCCCACC TGGGTAGGCA TTCATAGGGA TTCGGGTATT TCCTGCAAAA 450 

451 AAGCGATTAA GCTGGCTTCT AA7KACTGGCT AGGCCGGATT CTGTGGCCTT 500 

501 CACTACCAGG TGATTT TCAT GTGATCCGTG CATTCTAGCA CTTTGCTATG 550 

551 TAACCCAAAC TGAAGTCGAC AACTATAAAT ATGCTACTTG CAGGATGTTA 600 

601 TCACGACACA ACTCCCAGTC TAGGAAGCCT AAGTTTAGTT TTGCTCGGAG 650 

651 ACAAGCAATT GTGGCCAGTC ACTATAGCTA CGTCAGAGGG TAGTGGGAGC 700 

701 AGTTGCGTCG TTGGATTGAA AACAGGTGGA TCGTATCAGA TATTATGCAT 750 

751 TCACATGAAC AGTAAATGTG GTACAGTACT TCGCAAACAA TAAAATCTGT 800 

801 CACAATTTAT TAGTGCACTC CTGTGACGTA AATGCTTCTA CGTCAGAGGA 850 

851 TTTGAGTCCG AGGGCTGCTG CACCCATCAC TAATGACGGT CTTTACCCAT 900 

901 CATCATGGAC CATTGTTCAC ATCCATGCTA TCACTGTCGT CCTGTCCATG 950 

951 CACTGCAGCC CTCTATAAAT ACTGGCACCC CTCCCCCGTT CACAGATCAC 1000 

1001 ACGACACAAG CAAGAAATAA ACGGTAGCTG CCATAACTAG TACAATG 1047 
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Fig. 19 

1 CTGCAGAACG ACATCGACTT GCTCAACCCG CCGGCAGAGC TTGAGAAGCT 50 

51 AAAGCACAAG AAGAAGCGCC TCGTCCAGTC CCCCAACTCC TTCTTCATGG 100 

101 TATACCTCTT CATGCCCTTT TCTAATTCTG CGTGATGGGG TGTTGCGAAT 150 

151 CATGCCCTAA AGAGACCCAA CGAJCTGTATC ATTGGAATTT AGTTTTGGCA 200 

201 AAATAAAGAC CTGCACCTAA TAGGGTGATT TGGTTTGGAA GAGAGTCCAG 250 

251 AAATTAGATT TAG T ATCGTT AACTACATCA AAAATTACTT CCAACAAAGC 300 

301 AACAAGACAG CTTAACTGCT CTTGTTTGCC ATGTAACATA CATGTAGTCA 350 

351 AGCTGGATGC TATAGGTTAT GCATTTTTCA TCAAATAAGT TGTGTTTATA 400 

401 AGAGAGCTAA AATGTGAAAA CAGAAATGAA AATGAATCTT AAAGTTACAA 450 

451 TGTATCGACT TATATTCATA TTAATTAATA TTAATATTAG TCGTGTCTGC 500 

501 TCTTGTGCCT GATCGGAGAC TGACTCTAAT CATGTTATTT TTCGTTTTTT 550 

551 TTCTCCTTTG TGTGCAGGAT GTCAAGTGCC AGGGCTGTTT CAGCATGTAA 600 

601 GTGTTCATGA TTCTTGCTCC TTTTTTGTTT TTAGTTACTA CAGTTGCAAT 650 

651 TGATATAGAC GCCGTGTCAT TCCATTTATC CATTGCTGAT TGCTGATGTG 700 

701 ATCTCCTAAT CTTGTGTTAC TGGTTTTCCA TTGACAGTTT AGCATGATCA 750 

751 GTTTACTAGT AGGGTTGCGA TTACTACTAG CTAGTTAAAA TATGAAAATT 800 

801 CTTGGTTTAG TTATAAGGTT ATATATTGAT TTCGTAAAAT TTCATACCCC 850 

851 CTCACTTCCC GAATATGTAG GACAACAAAA TTGTTTATAG ATGGAGATGG 900 

901 AAACTGTTAA TGTTTGACTC CTGTTTTTGT ;:TTCTTTCTCA CCTGACCAGT 950 

951 ACAAGTACAA TTGTTCTGTT TAAATGTGGG TTAATTTGGA TTCAACAACA 1000 

1001 ACAACAACAA CAACATAGCC TTTTGTCCCA AGCAAGTTGG GGTAGGCTAG 1050 

1051 AGATGAAACC CAAAAGAGAC GAGAAACAGG GAGACACAAC GTTACACCTC 1100 

1101 GCTGCTTTAT TTGGATTACG CTTGTATTTC TTTTGATAAT TGCACAACGT 1150 

1151 TACACCTAGC TGCTTTATTT GCTGCTTGCA AGTGTAGCTG GTTTGATCAT 1200 

1201 GTTGTGTACG AGTTCTAATT TACGTGTGAC CACTAAGCTT TACTGCAATT 1250 

1251 ATCTGCGATC TTTAAATGTT ATCCTTTGTT GGAAGGTTAT TATGGTTGTA 1300 

1301 TAGCTTCTGT GATATCACGA TTCTGAACAA ATCAAATGTT TGCTGTAGCT 1350 

1351 TACAAAGTTT TTGTGATTGC AGAACCACTG "TGTTCAGCCA CTCCCAGACT 1400 

1401 GTGGTTGTGT GCCCAGGCTG CCAAACTGTT CTCTGCCAAC CTACTGGTGG 1450 

1451 GAAGGCCAGG CTCACCGAGG GGTGCTCCTT CCGTCGCAAG GGCGATTAGA 1500 

1501 TCCTGCTGTC TTTTAATGGG AGAAGAGAGA AATTTGCTGT TTTACAACCT 1550 

1551 TCCCATGATA TGTACTCGTT GAGGATTTTG TTAATTATTA TGACTGTTTA 1600 

1601 GCTTGCCCTG TCAATGCTTT TGTAAACGTG CACTTTCCTT GAGCTTAGTG 1650 

1651 ACATCTACTT AAGG TGCTGT TTGGTTTTGC CAAGGGATTG GCAGTGGTAG 1700 

1701 TGAAATCAGT TGCTGACGTT ATTTGTTGGG GTTTAACTGA AAGTGTCATT 1750 

1751 TGTTTAGTTG CCTTGTGTAA TGGATTCAAT TTAACAACTG ACTAGTACGG 1800 

1801 TCTCTAGCCC'AATGCAGCGT TCACTCTCAG CGGCAGATTG TTCTGTAGCA 1850 

1851 GTTAACGGTC ATAGGAGTGC CCTCGCGCTC GCCTTTGTCC CCTCTCTCCC 1900 

1901 TTCCTCCTAT GGAGGAACGC CCGAGGCGAC CGCATCTCGA AGATG 1945 
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151 


TAGATAGCAA CACCACGAAT CAGTCTTCTA AAACTTCACT AGTTTATTAT 


200 
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CACCAGCGGA TGCAAACAAA GCTCCACTCA CACTGAAGCA AATAGCTCTT 


250 


251 


ATAGCGTCTG AATGAGAACG ACCACCACAA TCATCTGACA ATGAAACTGG 


300 


301 


ATGACCAGCC CTGGAGGAAC AGAGTAAATA TTTACTTGCA CATTCCACCA 


350 


351 


AATGAAATGC GGAGAAAA6G CGCTCCACTT CTCACTGAAA CATGTGAACT 


400 


401 


CAGGATTCAA CAATATAAAA TTAAGCAACC AGGACCTGAT TTCGTTAGTA 


450 


451 


ACAATTTACA CAACCTGACT CACAAGAATT CCATTGTTTT CTTAAAGAAA 


500 


501 


ACTTTCCATT CCTCTCCACA CGAAACTAAG TTATAAGAGA AAACAACTAA 


550 


551 


CGATAAGCAG CAGATAGATT CAGTTCAGAA ACGCACTACA ACCTCCACCA 


600 


601 


AAATCTACCA ATCGTGTAAT CAAACACTAG CACTTTCTGC ATAAGAGGTC 


650 


651 


CATGTTTAAT TAGGAAGCGG TGATTGAAGC GGGTATTCCA ATTTCGAAAC 


700 


701 


CTGAGATCGA ACACGCGGAG CTCGGGCCCA ATGGCGACGG CGACGGAGTT 


750 


751 


GCCGTGCGGG TGGGCGCGAC GAGCGCCGGC GCGAACTCCG CCGCGCCGCT 


800 


801 


TACCCTCGGG CTCTTCGATC GCTGTGTCCT CCATTGCAGC GGCAGCCGCA 


850 


851 


AGATAGGGAG CTCGGATTCG AATCGAGCGC GAGACGGAGQ CGGGGAGGGA 


900 


901 


TTCAGGGATT AGGGTT TACA GGCAGTCGCA CATGGGGCCC AGAGACCAGT 


950 


951 


GTCACGAAGG AGAGCCCGGA TGGGTTGCAG TTGCAGGCTT GCAGCCCAAA 


1000 


1001 


GGGCAAAAGC CTTTGGTTTG TCGGTCATGG GCCTCCACGA AAACTGTCTC 


1050 


1051 


TGTCGGACTG CCCAATCCCA CGGAAGGCCG AGATGAACGC AGCACCTCGA 


1100 


1101 


TGAACGCCTC AGATTCGCCA ACCCACACGA CCGCACCTAT ATAAAGTATC 


1150 


1151 


CACCCTCCGC TCCCGTCTTC CTCTCCGCCG CCGCCACGCG CTCGCCTTTG 


1200 


1201 


CTCCCTCTGT TTCCCTTCAT AGAAAGGCGC AGGCCAGGAA CGCCCAAGGC 


1250 


1251 


GACCGCATCT CCAAGATG 1268 
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Fig. 21 



1 CTGCAGCTAG CTGATGATCG AGACTCCGTT TGAACTTTGA AATGTACGAA 
51 TTTTCTTGAA CCATGTGAAA TCGGAGTCAT TTTGCACAAA ACTTGTGAAT 
101 GAAGATCCAA ATATTCCTAA CGTATAGAAC TTCAAAGAAA GCTGGAGAAA 
151 ATGAAAGAAA GATCACTTAT AGAAAAAAAA AATGAAACTG AACTTTGCCT 
201 TGCCACCGAT GACATCAAAA GCATTTTCGC CTAATGCTTG TAGTGGACCG 
251 GCCTTGAGCT AGCTGCCATG TGATCGCTGA TCCTTTCGGC AGTGATGAGC 
301 TAGCTATGCT ACTTCACTGA AGCGATGATG AGGTGTCTCG CTGCCCCCAT 
351 TGCGGTGTAC CACAAAACGA GTCTGAGCTC GCTGTCCTGC TGATGGCATC 
401 CTTACTTGCT TTTTCCATTA TTTTGCAAGG CAAAGTTGAT CCATGGACAA 
451 CTACTCCCGC AGAACAGTTC AATGGGCTCA AATATTTCTA TGCTAGCTTT 
501 TTCGATAAAG GTGGTGGTGC CTAATGTTGT CTAAAGCAAG GAGACGGACT 
551 TGACCCAAAG TTGATAAGGG TCTCATCCAT TTGCCTTGAT TAAGCGGAAC 
601 AAGACACTTG ATAGGAATAG GTTTGGTTTT TACCTAAAAT GATGCACATG 
651 AAAGTTATTT TTTGTCAAAC TCCAAATCCT CAAATAGCTT ACCAAAGTTT 
701 TGGCCAAATT TAGATTTGAA AATAAAGTAT AGTGTCGAAT AATTATGTTG 
751 TCTACCTAAA CTTTTTTTCC ATCAAATAAA AGTTCAGAGT TTTTAGTGGG 
801 TGGTGATTGT TATATAGGGG GTCGACACGG AGCTCTTTTA ATGAACTAAT 
851 CTAAGTTTTC TAATAATCTA TATCTAATAT CTGTCATCCT TTGTCCCTTA 
901 CAACTGTCAG ATGGAGATTT GACGAACTCA ATCCCTTCAA TTCTTATACC 
951 CATACAAGCT AGAGCGACAC GCATCTGGGG CACACTGTGG TGTTCGATTT 
1001 GCAAACGAAT TGAAACGCAT GATGACATGA TCGCTAAATA AATCTCCAAG 
1051 CCGCAAGTCT TCTAGCAAGT AACGACGCAA GAAGTTAATT GTCTTATTGC 
1101 AGCGCACGGG TATATTTGCT AGTTATATTA ATAAGAGAAA ATTTCTTCAT 
1151 CCAAATTTTT GTTCGTCCCT CTCTCCCGAT CCATGTAACT GTCAACTCCC 
1201 TTGAGGGCCA ACAATAAGAG AATAGTGGTA CGTGATGAAT TAAATATAAG 
1251 GGTTCTAATA GGTTTACTGT TTTGTGTTGT GTCAAACTCA CCGCCGCGCC 
1301 CATATACGAT TCAACTAATC TTTGGATTAC CGTAACTTGA CCTGACTGTA 
1351 TCAAAACTCC TTTTATTCTG CTTAATGAAA TACGTGCTAA AGCACGATCT 
1401 CGAAAAAAAA CTACAGCACA GTGTCCAATT TCAAGATATA TTAGAGCAAA 
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Fig. 21 cont'd 
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